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‘  A  detailed  study  of  the  implementation  of  finite  element  methods  j 

for  solving  two-dimensional  elliptic  partial  differential  equations  is 
presented.  Generation  and  storage  schemes  for  triangular  meshes  are 
considered,  and  the  use  of  irregular  meshes  for  finite  element  methods 
is  shown  to  be  relatively  inexpensive  in  terms  of  storage,  ^’demonstrate 
that  much  of  the  manipulation  of  the  basis  functions  necessary  in  the 
derivation  of  the  approximate  equations  can  be  done  semi-symbolically 
rather  than  numerically  as  is  usually  done.  Ordering  algorithms,  compact 
storage  schemes,  and  efficient  implementation  of  elimination  methods  are 
studied  in  connection  with  sparse  systems  of  finite  element  equations . 

A  Fortran  code  is  included  for  the  finite  element  solution  of  a  class 
of  elliptic  boundary  value  problems,  and  numerical  solutions  of  several 
problems  are  presented.  Comparisons  among  different  finite  element 
methods,  and  between  finite  element  methods  and  their  competitors  are 
included. 
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CHAPTER  1 


INTRODUCTION 


1.  Aims  of  the  Thesis 

Our  main  goal  in  this  thesis  is  a  detailed  study  of  the  implementation 
of  finite  element  methods  for  solving  linear  elliptic  partial  differential 
equations  in  two  dimensions.  Our  study  is  restricted  to  problems  which 
can  be  formulated  as  finding  the  stationary  values  of  a  quadratic  integral 
over  a  given  class  of  functions.  Thus,  we  consider  inhomogeneous  second  order 
elliptic  boundary  value  problems  in  the  plane  which  are  either  formulated 
as  least  squares  problems  or  can  be  placed  in  variational  form.  In  the 
text  we  consider  equations  with  variable  coefficients  and  problems 
involving  boundary  integrals,  although  the  Fortran  code  we  actually 
present  can  handle  a  less  general  class  of  integrals.  However,  the 
majority  of  the  program  would  remain  unchanged  for  more  general  problems. 

Our  viewpoint  will  not  be  that  of  a  person  who  wishes  to  solve  a 
specific  problem.  Instead,  we  will  adopt  the  attitude  of  one  who  must 
provide  a  general  program  which  is  efficient,  easy  to  use,  and  applicable 
to  a  reasonably  large  subclass  of  two  dimensional  linear  elliptic  boundary 
value  problems.  Thus,  the  capability  of  handling  odd-shaped  domains  and 
general  (non-D.irichlet)  boundary  conditions  in  a  uniform  manner  will  be 
important.  Our  study  will  include  the  problems  of  mesh  generation  and 
the  solution  of  the  sparse  systems  of  finite  element  equations,  as  well 
as  the  actual  generation  of  those  equations. 
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We  will  also  be  interested  in  the  performance  of  finite  element 
methods  (for  our  chosen  class  of  problems) .  We  will  evaluate  them  by 
comparing  numerical  solutions  to  selected  problems  obtained  by  different 
numerical  methods,  including  among  others,  finite  difference  methods.  We 
will  also  compare  different  finite  element  methods;  that  is,  finite 
element  methods  using  different  bases.  Our  results  should  offer  some 
evidence  as  to  which  numerical  technique  is  best,  although  the  question 
of  what  we  mean  by  "best"  is  indeed  very  complex.  Obviously,  if  we  choose 
our  problems  carefully,  almost  any  method  can  be  made  to  look  best.  If 
we  have  a  specific  problem  that  must  be  solved  many  times,  then  it  may 
very  well  be  worthwhile  to  find  the  best  method  for  that  particular 
problem  (even  though  the  method  is  applicable  to  a  rather  small  class 
of  problems,  and  therefore  unsuitable  for  the  purposes  we  have  set  down 
above) .  For  our  purposes,  the  following  questions  will  be  of  more  or 
less  equal  importance  in  evaluating  and  comparing  numerical  methods: 

(a)  What  accuracy  is  achieved  for  a  given  amount  of  computation? 

(b)  What  storage  is  required? 

(c)  Does  the  method  rely  on  domain  shape?  (For  example,  does  it 
only  apply  for  square  domains,  or  rectangular  polygons?) 

(d)  Does  the  method  utilize  a  special  technique  which  requires  some 
information  known  only  to  an  expert  in  the  field?  If  so,  can 
the  technique  be  integrated  into  the  program  so  that  the  ama'eur 
user  can  use  the  technique  unassisted? 
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(e)  How  generally  applicable  is  the  method?  For  example,  must  the 
coefficients  of  the  differential  operator  be  constants  or  be 
restricted  in  some  other  way?  Are  normal  derivative  or  mixed 
boundary  conditions  easily  handled? 

Obviously,  whether  some  or  all  of  these  considerations  are  important 
depends  upon  individual  needs  and  circumstances,  but  from  our  viewpoint 
of  designing  a  general  purpose  program,  we  would  like  to  use  a  method 
which  yields  a  satisfactory  response  to  all  of  them.  Our  aim  is  to  show 
that  finite  element  methods  are  very  strong  candidates. 

Many  of  the  comparisons  of  numerical  methods  which  appear  in  the 
literature  are  made  in  the  context  of  solving  a  specific  problem,  and 
the  comparisons  are  often  made  on  the  basis  of  (a)  and  perhaps  (b),  with 
much  less  emphasis  (perhaps  only  acknowledgement)  of  differences  in  (c), 
(d),  and  (e) .  Given  the  high  cost  of  program  development  and  the 
diminishing  cost  of  computing  power  and  hardware,  we  feel  these  latter 
considerations  deserve  more  attention  than  they  normally  receive.  Our 
emphasis  in  this  thesis  will  be  on  a  method^  general  utility  rather  than 
on  its  ability  to  solve  any  particular  problem  "better"  than  it  has 
been  solved  before.  Hence,  many  of  our  conclusions  will  be  of  a 
qualitative  rather  than  quantitative  nature.  Nevertheless,  we  feel 
such  results  are  important  and  useful.  A  review  of  the  thesis  and  a 
summary  of  our  results  are  found  in  Section  1.5. 

Throughout  the  text  "section  nrn2  "  will  mean  section  n2  of 

chapter  n.^  .  Equations,  figures  and  tables  in  section  ng  will  be  numbered 

(n2'3-)>  (n2,2)>  •  •  •  >  and  references  in  chapter  nn  to  figure  (n0.n,)  also 

x  2  3 

in  chapter  n1  will  just  be  n^n^;  references  to  figure  (n^n^)  appearing 

in  another  chapter  would  be  written  (n  .np.n.,) . 

^  ✓ 
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The  Variational  Principle  and  a  Brief  Discussion  of  Ritz  Methods 


For  many  boundary  value  problems  of  even  order  it  is  possible  to 
construct  an  integral  I[v]  which  can  be  formed  for  all  functions 
lying  in  a  certain  class  V  and  which  takes  on  a  minimum  for  precisely 
the  function  ue.V  which  satisfies  the  boundary  value  problem.  This  is 
called  the  variational  formulation  of  the  problem,  and  usually  corresponds 
to  minimizing  the  energy  of  a  physical  system.  The  differential  equation 
of  the  boundary  value  problem  is  the  Euler-Lagrange  equation  obtained  by 
imposing  the  condition  that  the  first  variation  of  T[v]  vanish  [02] . 

For  example,  let  R  be  a  two  dimensional  region  bounded  by  a 
piecewise  smooth  curve  c)R  .  Consider  the  problem 


R 


where  veV  ,  the  class  of  functions  in  C(R  U  &R)  witn  first  derivatives 
in  L^(R)  and  satisfying  (2.2)  [C2]. 

The  Ritz  procedure  for  finding  an  approximate  solution  to  (2.1) -(2.2) 
is  as  follows:  Let  /cV  be  a  finite  dimensional  subspace  of  V 
spanned  by  the  functions  ^  ,  i  =  1,2, ...,N  .  Our  aim  is  tc  obtain 


1+ 


an  approximation  vN  to  u  by  minimizing  l[v]  for  veV^  .  Writing 
vN  in  the  form 


(2.4) 


N 


N 

-  X.  “A 

k-o. 


where  0^  ,  k  =  1,2,  .  ..,R  are  real  numbers  to  be  determined,  we  use 
(2.4)  in  (2.3)  to  obtain  the  quadratic  function 


=  Q?  Act  +  20?  b  , 
where  aT  =  (a^,  ...,<y  , 


(2.6)  A.  .  =  (T  (ty.  >Jr .  +  ty.  V.  )  dxdy 

v  '  10  JJ  'Yi*x  o,x  i,y  o>y' 

R 

and 


(2.7)  bi  =  JJ  f'lr.dxdy 


R 


Using  the  important  fact  that  A  is  symmetric,  we  obtain  the  system  of 

equations  ACC  =  -b  which  determines  the  coefficients  a  in  (2.4) 

*  U  * 

yielding  the  minimizing  v  e  V  .  Under  appropriate  hypotm-ses,  v  u 

as  N  -♦  oo  [K3].  The  importance  of  the  finite  element  method  is  that 

it  allows  us  to  construct  ijn’s  which  satisfy  these  hypotheses  and  which 

also  have  attractive  computational  properties.  This  is  taken  up  in  the 

next  section. 
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Note  that  for  our  chosen  class  of  problems,  it  will  always  be 
possible  to  arrange  that  the  coefficient  matrix  of  the  linear  system 
we  must  solve  is  symmetric,  since  oFaq:  =  ^  qFaq:+^  otA^a  =gQ^(A  +  A^)a 
oFact,  where  A  is  obviously  symmetric . 
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3.  Essential  Characteristics  of  Finite  Element  Methods 

The  term  "finite  element"  appears  to  have  originated  in  the  early 
1950's  with  structural  engineers  who  regarded  conventional  structures 
as  composed  of  a  number  of  separate  elements  interconnected  at  node  points . 
The  concept  was  extended  to  continuous  problems  such  as  plate  bending  and 
steady-state  temperature  distribution,  where  the  elements  are  merely 
subdivisions  of  the  domain  of  the  problem  with  adjacent  elements  having 
a  common  vertex  or  common  side.  The  most  common  element  shapes  are 
triangles  and  rectangles.  Our  attention  will  be  devoted  almost  exclusively 
to  triangular  elements  in  this  thesis,  primarily  because  odd  shaped 
domains  can  be  more  easily  divided  into  triangles  than  rectangles. 

Finite  element  methods  are  Kits  methods  which  use  basis  functions 
having  small  support;  that  is,  Ritz  methods  which  make  use  of  a  so-called 
"local  basis".  In  Chapter  3  we  will  discuss  the  actual  procedure.  At 
this  point  we  simply  observe  that  finite  element  methods  make  use  of 
trial  functions  vN  (see  Section  1.2)  having  the  form 

(5-1)  =  L  “A  ’ 

where 

(a)  vN  is  a  piecewise  polynomial  on  R  U  dR  . 

(b)  vN  is  a  polynomial  on  each  element. 

(c)  each  basis  function  is  associated  with  a  node  point  lying  on  a 
vertex,  side,  or  interior  of  an  element,  and  is  non-zero  only  on 
elements  containing  the  node.  This  property  is  depicted  below: 


7 


Figure  3-1-a  Figure  3.1-b 

Support  of  \|r  associated  Support  of  associated 

K  K 

with  a  corner  (vertex)  node.  with  a  side  node. 


Figure  3-1-c 

Support  of  ^ 
associated  with  an 
interior  node. 


More  than  one  basis  function  may  be  associated  with  a  particular  node, 

and  because  of  the  way  the  basis  functions  are  chosen,  the  nodal 

parameters  associated  with  each  ^  turn  out  to  be  the  value  or 

N 

the  value  of  a  derivative  of  v  at  the  corresponding  node  point.  The 

choice  of  these  nodal  parameters  is  done  on  the  basis  of  (1)  the  number 

N  /  s 

of  degrees  of  freedom  v  has  on  each  element  and  (2)  the  continuity 
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H 

requirements  of  v  .  Indeed,  a  common  practice  is  not  to  consider  the 
basis  functions,  but  instead,  to  choose  the  parameters  so  as  to  uniquely 
characterize  the  polynomial  on  each  element  and  at  the  same  time  to 
attain  a  desired  degree  of  continuity  across  interelement  boundaries. 

N 

For  example,  consider  piecewise  linear  polynomials,  for  which  v 

N 

is  a  linear  function  on  each  triangle.  The  trial  solution  v  can  be 
uniquely  characterized  by  its  value  at  any  three  non-collinear  points.  By 
choosing  these  three  parameters  at  the  vertices,  we  can  guarantee  continuity 
along  interelement  boundaries.  We  would  indicate  this  subspace  by  the 
element  stencil 


v 


,  or  just 


It  is  fairly  easy  to  see  that  this  amounts  to  using  a  "pyramid 
function"  at  each  vertex  node,  as  depicted  below: 


9 


Figure  3 >2  Pyramid  Function  ty.  associated  with  node  k  . 

Some  other  common  stencils  associated  respectively  with  quadratic  and 
cubic  polynomials  are 


Quadratic  Cubic. 

Note  that  in  the  last  example,  three  basis  functions  will  be  associated 
with  each  corner  node,  and  the  function  associated  with  the  interior  node 
will  be  non-cero  only  on  the  triangle  containing  the  node.  A  (non-exhaust ive) 
list  of  stencils  can  be  found  in  Appendix  A. 
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Piecewise  polynomials  derived  in  this  way  are  sometimes  referred 
to  as  interpolation  polynomials,  since  they  are  characterized  by  the 
values  (and  perhaps  derivative  values)  that  they  assume  at  the  node 
values.  Note,  however,  that  in  our  application  the  piecewise  polynomial 
will  not  (usually)  interpolate  the  solution  of  our  boundary  value  problem. 

We  will  not  consider  the  important  mathematical  question  of  when 
/  \  N 

(and  how  fast)  v  -•  u  as  N  -*  co  .  We  will  simply  make  some  observations 
and  refer  to  relevant  sources  in  the  literature: 

(i)  Because  each  basis  function  vanishes  over  most  of  the  domain,  the 
linear  system  that  is  generated  is  sparse.  Strang  [S5]  emphasizes  this 
by  stating  that  "by  a  suitable  choice  of  the  trial  functions  . . .  the 
Galerkin  equations . . .  turn  out  to  be  difference  equations" .  Whether  we 
call  them  finite  element  or  finite  difference  equations  is  largely  a  matter 
of  taste;  we  prefer  the  former,  and  reserve  the  term  "finite  difference" 
for  those  methods  based  on  divided  difference  approximations.  For 
polynomial  basis  functions  of  low  degree  the  two  approaches  sometimes  yield 
the  same  equations.  Our  distinction  is  made  on  the  method  of  derivation 
rather  than  the  end  result. 

(ii)  The  value  of  finite  element  methods  will  obviously  depend  upon 
how  well  the  trial  functions  can  approximate  the  true  solution  of  our 
boundary  value  problem.  This  problem  has  been  studied  for  general 
elliptic  operators  and  tensor  product  approximating  spaces  in  [b8,s6,S1] 
and  in  references  contained  therein.  We  will  briefly  discuss  tt e 
practical  advantages  and  disadvantages  of  these  spaces  in  Section  1.4. 
Bramble  and  Zlamal  [B12J,  Zlamal  [Z4,Z5],  ^ene^ck  [Zl],  Goel  [G2], 
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and  others  have  proved  convergence  of  the  method  and  presented  bounds 
for  various  elliptic  operators  and  piecewise  polynomials  on  triangles. 
Qualitatively,  their  results  say  that  if  the  approximating  subspace  is 
admissible,  and  the  true  solution  u  is  "smooth  enough",  then  an 
increase  in  d  (the  degree  of  the  piecewise  polynomial)  induces  an 
equivalent  decrease  in  the  error  bound.  That  is,  the  error  bounds  are 
of  the  form 

llu-v\  <  g  hd+1“q  ilulld+1  > 

where  |j u  f  =  I  PMf  2  ,  i  =  (i^iJ  ,  |i|  =  i^±2  >  and 

|i|<*  L 

i  u 

D  u  =  — t - —  .  Here  h  is  the  maximum  length  of  any  triangle  side  in 

axV2 

the  mesh,  and  <3  is  a  constant  which  depends  upon  uhe  sharpest  angle  in 
the  mesh  and  the  polynomial  basis  (element)  being  employed.  For  specific 
details,  the  reader  is  referred  to  the  papers  mentioned  above. 

(iii)  The  condition  of  the  finite  element  linear  system  which  we 
obtain  will  obviously  depend  upon  our  choice  of  tyrs  .  Indeed,  one  of 
the  problems  of  using  the  Ritz  technique  has  been  the  numerical 
instability  of  the  discrete  problem,  caused  by  choosing  almost  linearly 
dependent  trial  functions.  Intuitively,  we  would  anticipate  that  such 
problems  would  be  much  less  troublesome  for  the  finite  element  method 
because  the  majority  of  the  ty's  will  be  orthogonal  (by  virtue  of  having 
disjoint  support).  Strang  and  Fix  [S6]  study  this  problem  in  depth  for 
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uniform  meshes  by  examining  the  condition  number  H(G)  =  ||Gn||  )jGn  || 
of  the  Gram  matrix,  whose  entries  are  the  inner  products  of  the  basis 
elements  ^  .  They  conclude  that  all  the  usual  piecewise  polynomial 
trial  functions  yield  a  stable  basis,  where  stability  means  that  H(G) 
remains  bounded  as  h  -♦  0  .  They  show  that  the  condition  of  the 
coefficient  matrix  A  obtained  from  the  application  of  the  finite 
element  method  (using  a  stable  basis)  to  a  uniformly  elliptic  operator 
of  order  2m  is  of  the  form  Ch  ,  where  h  is  the  mesh  width  and  C, 
depends  on  the  choice  of  the  basis.  This  result  is  of  practical 
significance;  for  a  given  problem  it  says  that  as  long  as  we  use  a  stable 
basis,  the  condition  of  the  coefficient  matrix  does  not  deteriorate  as 
we  increase  the  degree  of  our  polynomials.  Note  that  these  results  only 
apply  for  uniform  meshes,  and  it  is  not  known  how  detrimental  severe 
grading  of  the  mesh  may  be  to  the  condition  of  the  matrix  A  . 
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4.  Tensor  Product  Spaces 


Suppose  =  [0,1]  is  divided  up  into  a  uniform  mesh  with  grid 
points  (ih,  i  =  0,1,2,  ...,n)  and  assume  we  have  a  basis 
('l'.  (x)  ,  i  =  0,1,2,  ,..,n)  on  [0,1]  ,  where  each  is  non-zero  on 

the  interval  [i-ph, i+ph]  ,  with  p  small.  Now  consider  the  domain 
Dg  =  [0,1]  x  [0,1]  ,  covered  by  a  square  mesh  with  grid  points 
((ih, jh)  ,  i  =  0, 1,2, . .  .,n,  j  =  0,1,2,  .. .,n  )  .  To  construct  a  tensor 
product  basis  on  D0  we  form  the  functions  =  ^(x)'(f.(y)  , 

i,j  =  0,1,2, ...,n  .  The  trial  function  v  ,  where  N  =  (n+1)  ,  is 

then  given  by 


(4.1) 


N 


n 


* 


a.  .'!f. . 
ij  ij 


The  main  advantage  to  this  approach  is  that  it  is  possible  to 
obtain  a  relatively  smooth  approximation  with  only  moderate  N  ,  since 
if  'iheC^D^J  ,  i  =  0,1,  ...,n  ,  then  eC^tDg]  >  i>j  =  0,1,  ...,n  . 

This  is  often  done  by  taking  a  spline  basis  for  the  $.*s  .  For 

N  2  r 

example,  we  can  have  v  (Dg)  by  using  the  cubic  spline  basis  LSI].  For 

a  specific  degree  of  smoothness,  the  number  of  parameters  (unknowns) 

in  the  problem  increases  ss  n  for  k  dimensions.  Of  course,  the 

reason  for  these  properties  is  the  fact  that  the  interelement  boundaries 

[which  are  h  by  h  squares]  are  constrained  to  lie  along  coordinate 

axes,  and  this  brings  us  to  the  major  disadvantage  cf  this  method  of 

basis  construction. 

Because  our  elements  are  squares  [or  perhaps  rectangles  --  it  is 
easy  to  scale  the  basis  functions],  our  domain  must  be  restricted  to  be 


the  union  of  rectangles*  Furthermore,  it  is  virtually  impossible  to 
grade  (i.e.,  subdivide)  the  net  "locally".  If  a  fine  mesh  is  desired 
in  a  region  of  the  danain,  then  it  must  be  made  fine  in  an  interval  in 
each  coordinate,  even  though  we  only  desire  the  fine  mesh  in  the 
intersection  of  these  intervals.  It  is  fairly  easy  to  conceive  of 
realistic  problems  which  would  force  the  grid  to  be  almost  uniformly 
fine. 

However,  there  is  seme  reason  for  optimism  regarding  this  geometrical 
problem;  Bramble  and  Schatz  [Bill  and  Babuska  [Bl]  have  analyzed  some 
methods  that  do  not  require  that  the  basis  functions  satisfy  any  boundary 
conditions.  The  basic  idea  is  to  imbed  the  given  domain  R  with 
boundary  SR  in  a  larger  domain  E'  oB  ,  with  the  basis  functions 
satisfying  homogeneous  boundary  conditions  on  the  boundary  of  R’  . 

A  boundary  integral  on  SR  scaled  by  h~7  ,  0  <  7  <  ro  (where  h  is 

the  mesh  width),  is  added  to  a  least  squares  formulation  of  the  problem. 
The  boundary  integral  is  designed  so  that  its  minimum  occurs  when  the 
approximate  solution  satisfies  the  boundary  conditions  on  SR  .  As  would 
be  expected,  their  error  estimates  depend  upon  the  smoothness  of  the 
boundary  data  and  the  solution.  They  show  that  7  =  5  is  optimal  in 
some  situations. 

We  have  not  pursued  this  avenue  of  investigation  in  this  thesis 
because  the  approach  we  use  to  generate  uur  basis  functions  allows  us 
to  fairly  easily  satisfy  boundary  conditions. 
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5-  Review  of  the  Thesis  and  Summary  of  Conclusions 

As  our  title  implies,  the  emphasis  in  this  thesis  is  on  implementation, 
and  such  a  study  leads  to  interesting  practical  problems  which  are  seldom 
discussed  in  papers  on  finite  element  methods.  Engineering  articles  on  finite 
element  methods  are  often  devoted  to  discussing  the  virtues  of  particular 
elements  for  solving  specific  problems.  Mathematical  papers,  on  the  other 
hand,  are  usually  concerned  primarily  with  rates  of  convergence  of  various 
finite  element  spaces.  We  feel  our  work  lies  between  these  two  extremes; 
we  are  concerned  with  the  actual  implementation  of  finite  element  methods  and 
how  they  compare  in  practice  with  other  methods  for  solving  elliptic  boundary 
value  problems. 

In  Chapter  2  we  examine  the  problem  of  generation  and  storage  of 
two-dimensional  triangular  meshes.  We  begin  by  reviewing  previous  work 
on  automatic  mesh  generation.  We  then  present  a  semi-automatic  procedure 
for  triangulation  of  a  domain.  The  method  requires  the  user  to  provide  a 
gross  triangulation  of  the  domain,  reflecting  any  desired  grading.  The 
mesh  is  then  refined  by  any  specified  factor  by  the  program.  We  feel 
this  compromise  solution,  although  not  particularly  elegant,  is  important 
for  several  reasons:  (a)  the  required  input  for  most  domains  is  small, 

(b)  the  method  can  easily  be  adapted  for  use  with  graphical  display 
equipment),  (c)  curved  boundaries  can  be  incorporated  easily, 

(d)  the  net  can  be  graded  under  control  of  the  user,  and  ( e)  inter-element 
boundaries  can  be  forced  to  lie  in  specific  positions  (along  lines  of 
material  discontinuity,  for  example) . 

Chapter  2  also  contains  a  description  of  a  completely 
automatic  domain  triangulator.  Although  the  algorithm  cannot  be  considered 
a  finished  product,  we  have  included  it  because  we  feel  it  represents  a 
promising  approach  to  automatic  triangulation.  It  is  applicable  to 
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arbitrary  simply  connected  domains  and  is  designed  to  produce  graded 
nets  where  appropriate.  Some  examples  of  meshes  produced  by  the  algorithm 
are  presented  and  some  further  areas  of  research  are  suggested. 

The  final  section  of  Chapter  2  contains  an  efficient  storage  scheme 
to  represent  arbitrary  triangular  meshes.  Using  this  scheme  along  with 
some  results  obtained  in  Chapter  we  compare  the  storage  required  for 
the  mesh  to  the  number  of  non-zero  elements  in  the  coefficient  matrix. 

We  show  that  except  for  piecewise  linear  polynomials,  the  storage 
required  for  the  mesh  is  small  compared  to  that  required  for  just  the 
non-zero  elements  in  the  coefficient  matrix.  We  conclude  that  the 
mesh  storage  will  seldom  he  an  important  factor  in  overall 
storage  requirements  in  the  application  of  finite  element  methods. 

Chapter  3  deals  in  detail  with  the  actual  generation  of  the  finite 
element  equations.  The  process  consists  of  two  phases.  The  first  is  the 
computation  of  the  stiffness  matrices  which  express  our  integral  over 
each  element  in  terms  of  the  nodal  parameters  used  to  characterize  it. 

The  second  phase  consists  of  assembling  these  matrices  into  a  single 
large  system  and  eliminating  those  parameters  whose  values  are  already 
specified  by  boundary  conditions .  For  the  first  phase  we  describe 
one  method  for  generating  coefficients  of  the  equations  on  each  triangle. 

We  justify  our  use  of  the  approach  over  others  by  demonstrating  where 
much  of  the  computation  and  manipulation  of  the  basis  functions  can  be 
carried  out  symbolically,  thus  avoiding  use  of  numerical  (or  hand) 
integration  and/or  differentiation.  Section  J>.b  deals  with  the  assembly 
of  the  equations.  Boundary  conditions  which  involve  derivative 
parameters  cause  annoying  implementation  problems  if  the  boundary  is  not 
parallel  to  the  x  or  y  axis,  since  relations  between  several  parameters 


must  sometimes  be  satisfied.  We  discuss  two  alternate  methods  of 


handling  these  problems  and  compare  the  implementation  of  each. 

A  study  of  sparse  matrix  methods  is  the  subject  of  Chapter  4, 
with  particular  emphasis  on  the  type  of  matrices  arising  from  finite 
element  methods.  We  introduce  the  concept  of  the  profile  of  a  matrix, 
and  distinguish  between  graph  methods,  profile  methods  and  band  methods. 
We  present  arguments  and  experimental  evidence  supporting  the  use  of 
profile  methods. 

In  Section  4.5  we  compare  several  ordering  algorithms  applied  to 
matrices  arising  from  different  finite  element  bases.  These  experiments 
show  the  following:  (a)  profile  methods  can  be  significantly  better 
than  band  methods,  in  terms  of  both  storage  requirements  and  operation 
counts;  (in  Sections  4.1  and  4.2  we  show  that  they  will  never  be  worse 
than  band  methods.)  (b)  the  "reverse  Cuthill-McKee"  ordering  (our 
terminology),  which  we  have  discovered  compares  very  favorably  with 
other  methods  tested  ;  (c)  comparison  of  times  required  to  produce  the 

reverse  Cuthill-McKee  ordering  with  some  of  the  times  required  for  the 
entire  finite  element  solution  (reported  in  Chapter  5)  suggests  that  the 
use  of  the  algorithm  is  relatively  Inexpensive.  We  feel  that  such 
information  is  extremely  important.  It  is  often  contended  by  experienced 
users  that  automatic  ordering  is  unnecessary  because  they  can  produce 
an  ordering  empirically  that  is  close  to  optimal.  This  may  very  well  be 
true,  but  not  all  users  are  experienced,  and  more  important,  one  must 
still  devise  a  way  of  communicating  the  desired  ordering  to  the  computer. 
We  have  shown  that  this  largely  clerical  process  can  best  be 
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left  to  the  computer.  The  code  for  doing  the  ordering  appears  as  part 
of  phase.  1  in ; Appendix  B. 

Also  in  Chapter  4,  we  derive  formulas  for  the  density  of  finite 
element  matrices  for  general  elements  and  arbitrary  triangular  and 
quadrilateral  meshes  with  holes.  Such  results  are  important  in  managing 
storage,  since  we  can  allocate  storage  for  the  matrix  as  soon  as  the 
mesh  and  element  to  be  used  are  known. 

Chapter  5  contains  results  of  several  numerical  experiments. 

The  chapter  contains  numerical  solutions  to  the  L-shaped  membrane 
eigenvalue  problem,  rhombical  membrane  eigenvalue  problems,  and  a 
hollow  square  Diric'nlet  problem.  Our  comparisons  are 
between  different  finite  element  methods  as  well  as  between  finite 
element  methods  and  their  competitors.  These  experiments  showed  the 
following: 

(a)  Efficiency  in  general  increased  with  increasing  degree  of  piecewise 
polynomial.  This  was  true  in  all  three  examples,  and  because  the 
solutions  ranged  from  very  smooth  ones  to  ones  with  singularities 

in  their  first  derivatives,  we  feel  this  information  is  significant. 

(b)  Finite  difference  methods  compared  rather  unfavorably  with  our 
finite  element  solutions.  Even  for  the  problem  where  special  fast 
direct  methods  for  solving  the  difference  equations  could  be 
utilized  [B15,G1],  our  finite  element  solutions  appeared  preferable. 

(c)  Several  methods  for  finding  eigenvalues  yielded  more  accurate  numbers 
than  finite  element  methods  (involving  roughly  the  same  cost),  and 
also  produced  bounds.  However,  these  methods  use  techniques  which 
utilize  a  special  feature  of  the  equation  or  of  the  domain,  and  are 


difficult  to  implement  in  a  general  code.  Again  we  emphasize  that 
we  are  not  saying  these  methods  are  inferior;  we  are  simply 
saying  that  they  are  less  suitable  than  finite  element  methods  as 
the  core  of  a  general  boundary  value  problem  solver. 

Appendix  A  contains  a  list  of  sane  typical  elements.  Some  of  these 
are  referred  to  throughout  the  text. 

Appendix  B  contains  a  listing  of  the  Fortran  code  we  have  developed 
for  solving  a  class  of  linear  elliptic  boundary  value  problems.  We  have 
segmented  the  code  into  modules,  each  one  designed  to  carry  out  a  specific 
task  or  set  of  tasks.  The  modules  execute  in  sequence,  with  information 
passing  from  one  to  the  next  via  external  storage  media  which  can  be  disk, 
dram  or  tape.  Our  reasons  for  segmenting  our  code  and  attempting  to 
keep  each  segment  itself  modular  are  (a)  to  ease  maintenance  and/or 
modification  of  the  code,  (b)  to  allow  the  program  to  be  run  on 
smaller  machines  than  the  one  we  used,  and  (c)  to  facilitate  documentation 
and  understanding  of  the  code  by  localizing  specific  functions.  Specific 
details  of  the  functions  of  each  segment  are  found  in  comments  in  the 


code  itself. 


CHAPTER  2 


GENERATION  AND  STORAGE  OF  TWO-DIMENSIONAL  TRIANGULAR  MESHES 

1.  Introduction 

The  first  step  in  most  numerical  methods  for  solving  partial 
differential  equation  problems  is  that  of  discretizing  the  domain  in 
question.  In  our  case,  the  problem  consists  of  dividing  our  given 
domain  R  into  disjoint  triangles  whose  union  is  R  U  SR  ,  with  adjacent 
tirangles  having  a  common  side.  If  R  has  curved  boundaries,  we  will 
admit  "curvilinear"  triangles  having  one  curved  side  in  the  triangulation 
near  the  boundary.  Figure  1.1  is  an  example  of  such  a  triangulation. 


Manual  genera. xon  of  a  triangulation  of  R  is  an  extremely  tedious 
job.  A  completely  automatic  triangulation  procedure,  on  the  other  hand, 
while  obviously  desirable,'  is  complicated  and  difficult  to  implement  with 
any  degree  of  flexibility.  In  Section  2  we  review  the  literature  on 
two-dimensional  triangulation  and  in  Sections  5  and  4  we  present  two 
new  methods  for  triangulation  of  two  dimensional  domains.  The  method 
described  in  Section  3  is  a  semi-automatic  scheme,  while  the  one  in 
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Section  4  is  almost  completely  automatic.  Section  5  contains  a 
description  of  an  efficient  scheme  for  storing  a  representation  of  an 
arbitrary  two-dimensional  triangular  mesh. 

Once  we  have  a  suitable  triangulation  of  the  domain,  we  are  faced 
with  the  problem  of  numbering  the  nodes  (more  precisely,  the  parameters 
associated  with  the  nodes)  in  order  to  reduce  the  computation  and/or 
storage  requirements  for  the  solution  of  the  algebraic  system.  Although 
it  is  possible  to  defer  any  ordering  (and  then  actually  order  the 
parameters  rather  than  the  nodes),  the  problem  will  be  considerably 
larger  if  each  node  has  more  than  one  parameter  associated  with  it. 

Since  all  the  parameters  associated  with  a  particular  node  are  connected 
in  the  same  way  to  other  parameters  as  well  as  all  being  connected  to 
each  other,  little  is  lost  by  ordering  the  nodes.  Many  good  ordering 
algorithms  require  work  proportional  to  the  number  of  nodes  multiplied 
by  the  square  of  the  number  of  neighbours  each  node  has,  so  substantial 
savings  can  be  achieved  by  ordering  at  this  stage.  We  defer  discussion 
of  these  algorithms  and  the  criteria  used  to  reduce  storage  and  computational 
requirements  until  Chapter  4,  although  again  we  emphasize  that  they  should 
be  applied  at  this  stage. 
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2.  Review  of  Previous  Work  on  Mesh  Generation 

Mesh  generation  is  a  difficult  part  of  a  boundary  value 
problem-solver  to  automate.,  and  even  the  most  generally  applicable  programs 
require  substantial  human  assistance,  especially  in  describing  the  boundary  c® 
In  most  boundary  value  problems  the  solution  is  not  uniform  in  character  over 
the  domain.  Often  it  is  fairly  smooth  over  most  of  E,  and  varies  rapidly 
only  over  a  small  part  of  the  domain,  perhaps  near  a  corner.  For  this  reason 
it  should  be  convenient,  if  desired,  to  indicate  areas  of  the  domain  R  where 
the  grid  can  be  coarse  and  areas  where  it  should  be  fine.  This  grading 
capability  could  provide  substantial  saving.:  in  storage  requirements  and 
computer  time. 

For  practical  reasons  finite  difference  programs  have  tended  net  to 
provide  for  the  grading  of  nets.  This  is  due  largely  to  the  ease  with  which 
one  can  store  a  regular  rectangular  net  in  a  conventional  two-dimensional 
array  and  the  severe  storage  management  problems  which  immediately  result 
when  one  departs  from  such  a  scheme.  In  the  regular  case,  actual  coordinates 
do  not  even  have  to  be  stored,  which  is  a  persuasive  argument  for  using  a 
regular  net.  Also,  truncation  error  bounds  for  some  difference  operators 
are  much  better  for  regular  nets,  and  the  determination  of  the  coefficients 
for  the  difference  operator  is  usually  much  easier  (a  prime  consideration  if 
an  iterative  scheme  is  being  used  and  the  coefficients  are  being  computed  each 
time  they  are  needed) .  Thus,  finite  difference  programs  usually  make  use  of 
uniform  meshes,  or  meshes  which  are  uniform  in  various  parts  of  the  region. 
Boundary  points  that  result  when  Sr  intersects  the  mesh  at  a  point  other 
than  a  node  point  are  treated  by  using  well-known  interpolation  formulas. 

(These  special  boundary  points  may  cause  storage  problems  even  when  the 
simple  two-dimensional  array  storage  method  is  used;  see  Forsythe  and  Wasow 
1?5,  pp.  361-63]  for  a  discussion.)  If  the  boundary  is  curved,  it  may  be 
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rather  awkward  to  find  the  correct  formula  to  preserve  the  order  of 
accuracy.  In  this  context,  the  actual  mesh  generation  is  not  a  difficult 
problem.  The  problems  arise  where  the  boundary  (which  can  have  more  or 
less  arbitrary  shape)  intersects  the  regular  mesh.  Cryer  [G3 ]  treats  this 
entire  problem  in  considerable  detail,  and  we  will  not  discuss  it  further 
here. 

The  mesh  generation  question  with  regard  to  finite  elements  has  a 

somewhat  different  flavor.  In  this  case,  grading  the  mesh  is  essentially 

without  cost  provided  we  are  going  to  store  the  node  coordinates  anyway. 

It  is  often  stated  that  irregular  nets  are  expensive  to  use  because  the 

coordinates  of  each  net  point  must  be  stored,  and  for  finite  difference 

methods  this  objection  is  valid.  [For  example,  suppose  we  are  solving 

Laplace's  equation  on  the  unit  square.  Using  a  uniform  n  by  n  grid,  the 

2 

required  storage  is  about  n  ,  assuming  we  are  going  to  solve  the  equations 

using  SOR.  By  comparison,  if  our  mesh  is  irregular,  we  must  remember  the 

2  2 
coordinates  of  each  of  the  n  nodes.  Then  we  would  need  a  total  of  3n 

words  of  storage,  and  if  we  want  to  avoid  recalculation  of  the  coefficients 

of  the  difference  operator  at  each  iteration  (which  will  no  longer  all  be 

the  same),  we  will  need  8n  words  of  storage.]  However,  for  finite  element 

methods,  the  number  of  node  points  will  ordinarily  be  considerably  fewer  than 

the  number  of  parameters  since  each  node  will  usually  have  derivative  as 

well  as  function- value  parameters  associated  with  it.  As  the  degree  of 

the  basis  functions  increases,  the  storage  required  for  the  nodes  quickly 

becomes  small  compared  with  that  required  for  the  coefficient  matrix.  This 

point  is  taken  up  in  Section  5  of  this  chapter. 
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We  will  now  review  some  methods  appearing  in  the  literature  which  have 
dealt  with  this  triangulation  problem. 

Cheung  and  Pedro  [C4]  have  written  a  program  that  generates  a 
triangulation  using  the  following  general  scheme.  The  domain  is  divided 
by  one  family  of  straight  lines  (which  do  not  intersect  in  the  domain  but 
are  not  necessarily  parallel)  or  arcs  of  circles  (not  necessarily  concentric) 
or  both.  Each  line  is  further  divided  into  a  number  of  divisions  to 
yield  node  points.  The  node  points  on  adjacent  lines  are  then  joined  in  a 
zig-zag  manner  to  form  triangles.  The  number  of  divisions  in  adjacent 
lines  can  only  differ  by  one  —  a  hindrance  if  pronounced  grading  of  the 
net  is  desired.  This  restriction  can  also  lead  to  triangles  with  very 
sharp  angles. 

Wo  attempt  appears  to  be  made  to  automatically  achieve  a  nodal 
numbering  yielding  a  small  bandwidth;  instead  manual  "supervision"  has  to 
be  exercised  at  various  stages.  The  only  attempt  to  avoid  or  remove  small 
angles  is  done  when  forming  two  triangles  from  a  quadrilateral;  the  lengths 
of  the  diagonals  are  computed  and  the  shorter  is  used  to  form  the  triangles. 
(This  can  be  disastrous;  consider  the  quadrilateral  (-1,0),  (0,-2),  (1,0), 
(0, e)  where  e  is  positive  but  very  small.) 

Frederick,  Wong,  and  Edge  [F7]  present  a  two-stage,  semi-automatic 
method  for  triangulating  a  two-dimensional  domain.  The  first  stage  consists 
of  manually  plotting  the  boundary  of  the  domain  and  the  node  points  (in  the 
order  designed  to  minimize  or  at  least  reduce  the  bandwidth  of  the  resulting 
linear  system)  on  an  electromagnetic  graph-tracing  table.  The  coordinates 


of  the  points  are  automatically  punched  on  cards  which  then  serve  as 
input  to  a  computer  program  that  generates  the  triangles.  There  are  a 
number  of  potential  drawbacks  to  this  approach.  The  first  is  that  for 
odd-shaped  domains  it  is  surprisingly  difficult  to  number  the  nodes 
empirically  so  as  to  achieve  a  small  bandwidth,  especially  if  the  net  is 
graded  rather  severely.  As  we  snail  see  in  Chapter  b,  bandwidth  is  not 
necessarily  a  very  good  criterion  anyway,  and  to  number  the  nodes  empirically 
to  achieve  other  (more  satisfactory)  criteria  can  be  even  more  difficult. 
Secondly,  without  actually  drawing  in  the  triangles  as  you  go  along  it  is 
hard  to  decide  where  the  next  node  would  be  placed.  If  the  triangles  are 
to  be  drawn,  very  little  more  manual  effort  would  be  necessary  to  tabulate 
their  respective  nodes,  thus  eliminating  the  computer  program  completely. 

As  the  authors  point  out,  however,  the  computer-based  part  of  the  procedure 
eliminates  the  clerical  errors  which  would  inevitably  result  from  tabulation 
by  humans.  Although  it  is  unfortunate  that  special-purpose  equipment  is 
required,  the  basic  procedure  is  very  appealing.  It  is  easy  to  see  how 
the  same  basic  idea  could  be  implemented  in  an  interactive  way  by  using  a 
cathode  ray  display  with  a  light  pen.  All  the  above  objections  could  be 
eliminated  if  an  automatic  ordering  scheme  (such  as  one  of  those  discussed 
in  Chapter  b)  were  included  in  the  implementation. 

Barfield  [b4]  proposes  a  method  based  on  a  conformal  mapping  of  the 
boundary  of  a  closed  two-dimensional  region  onto  the  perimeter  of  a 
rectangular  polygon  in  which  is  inscribed  an  orthogonal  rectilinear  grid. 

The  method  consists  essentially  of  finding  the  function  which  conformally 
maps  the  given  domain  R  onto  the  polygon,  and  then  using  the  inverse  of 
the  mapping  so  determined  to  find  the  image  of  the  orthogonal  grid  in  the 
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polygon.  The  method  obviously  generates  rectangles  rather  than  triangles, 
so  that  each  rectangle  would  have  to  be  subdivided  to  obtain  a  triangulation 
of  R  .  While  the  method  is  indeed  very  elegant,  considerable  care  appears 
to  be  necessary  to  avoid  distortion,  and  "long,  slender"  squares  yield  very 
poor  triangles.  Also,  the  work  involved  in  computing  the  mapping  may  be 
substantial. 

Winslow  [W5]  proposes  a  method  of  mesh  generation  which  consists 
essentially  of  solving  an  elliptic  boundary-value  problem  using  finite 
difference  methods.  The  mesh  lines  are  regarded  as  two  intersecting  sets 
of  equipotentials,  each  set  satisfying  Laplace's  equation  in  the  interior 
of  the  given  two-dimensional  domain  R  .  "Boundary  conditions"  are 
determined  by  where  the  lines  are  required  to  intersect  the  boundary  S  . 
Because  of  the  well-known  averaging  feature  of  harmonic  functions,  the 
generated  mesh  varies  smoothly  over  the  entire  domain,  its  relative  grading 
being  determined  by  the  density  of  the  points  of  intersection  on  S  (i.e., 
the  boundary  conditions) .  Triangular  and  quadrilateral  grids  can  be 
generated  using  the  method.  Although  the  examples  reported  are  very  nice, 
they  are  for  an  extremely  simple  domain,  and  Winslow  does  not  discuss  the 
problem  of  how  to  concisely  describe  a  general  domain  to  the  program 
(assuming  that  the  program  has  the  facility  for  handling  one),  and  how  to 
easily  input  the  boundary  conditions  (the  ends  of  the  potential  lines) .  As 
with  most  partial  differential  equation  problems,  the  above  tasks  and  the 
associated  data  management  problems  are  difficult  to  implement  in  general; 
once  done,  the  generation  of  the  equations  and  their  solutions  are  relatively 
straightforward,  even  though  they  may  require  considerable  computer  time. 

He  concedes  that  the  method  does  not  always  work  satisfactorily  near  re¬ 
entrant  corners,  with  node  points  outside  the  domain  sometimes  being 
produced. 
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Reid  and  Turner  [  Rl]  use  the  following  scheme  to  generate  nearly  regular 
meshes.  A  regular  equilateral  triangular  mesh  is  placed  over  the  domain  R 
so  that  dR  is  inside  the  mesh  boundary.  Points  where  triangle  sides 
intersect  the  boundary  are  called  "boundary  points",  and  node  points  of  the 
mesh  closer  than  h/2  to  a  boundary  point  are  moved  to  the  boundary  point 
in  such  a  way  as  to  guarantee  the  monotonicity  of  the  resulting  finite 
element  coefficient  matrix.  [A  matrix  is  said  to  be  monotone  if  it 
is  non-singular  and  all  elements  of  its  inverse  are  non-negative. ]  They 
consider  only  piecewise-linear  polynomials.  The  node  points  and  their 
incident  edges  which  remain  outside  c®  after  the  relocation  process  is 
complete  are  then  discarded,  yielding  a  mesh  on  R  which  is  regular 
except  near  the  boundary.  The  authors'  assumption  appears  to  be  that  c® 
has  no  corners,  and  this  restriction  on  c®  simplifies  the  node  relocation 
considerably.  Corners  in  <3R  must  necessarily  end  up  as  vertices  in  the 
triangulation,  so  the  presence  of  corners  imposes  further  restrictions  on 
the  relocation  of  nodes.  It  seams  clear  that  we  would  want  h  to  be  of 
the  same  order  of  magnitude  as  (or  smaller  than)  the  shortest  arc  in  dR 
in  order  to  avoid  generating  triangles  with  sharp  angles.  Such  a 
requirement  could  force  the  mesh  to  be  finer  than  otherwise  necessary.  This 
scheme  obviously  assumes  tha,t  the  user  desires  a  regular  mesh,  and  this 
may  not  always  be  true. 

Kamel  and  Eisenstein  [Kl]  present  a  mesh  generation  scheme  that  is  also 
based  on  a  regular  mesh.  The  user  supplies  the  boundary  oR  as  a  sequence 
of  arcs  subdivided  by  nodes.  First  the  authors  find  the  "best"  regular 
mesh  having  the  same  number  of  boundary  nodes  as  the  given  boundary  dR. 

Here  "best"  means  "closest  to  circular  shaped."  Their  program  begins  at 
a  node  of  a  regular  mesh  and  successively  annexes  rings  of  triangles  (the  last 
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ring  may  only  be  partially  annexed)  until  the  number  of  boundary  nodes 
in  the  mesh  equals  the  number  of  nodes  on  cffi  .  This  determines  the 
number  and  relative  positions  of  the  triangles  for  the  mesh.  The 
correct  number  of  nodes  are  then  placed  inside  R  and  the  mesh  is  then 
smoothed  by  applying  several  passes  on  the  interior  nodes,  using  the 
formula  • 

(2.1)  x  =  (  5Z  y)/|7l(x.)|  . 

y^(x.) 

The  authors  caution  that  their  procedure  does  not  work  well  if  the  input 
boundary  has  nodes  with  abrupt  changes  in  spacing,  or  if  the  domain  shape 
is  too  complex.  They  imply  that  interaction  with  the  algorithm  using  a 
graphics  terminal  is  an  advisable,  if  not  necessary,  part  of  using  their 


method. 


3. 


A  Serei -Automatic  Mesh  Generation  Scheme 


Ideally,  a  mesh  generation  procedure  should  have  the  capability  of 
grading  the  net  ( i .  e . ,  making  the  net  finer  in  selected  areas  of  the  domain) 
on  the  basis  of  information  supplied  by  the  user.  This  immediately  raises 
the  question  of  how  a  desired  grading  can  be  easily  transmitted  to  the 
program.  Also,  sometimes  the  "material"  in  the  domain  varies  abruptly 
from  one  region  to  another,  and  it  may  be  desirable  that  triangle  interfaces 
coincide  with  material  interfaces  to  allow  discontinuities  in  derivatives. 
This  requirement  would  obviously  complicate  a  completely  automatic 
triangulation  procedure  by  imposing  constraints  on  some  of  the  node 
positions. 

With  these  considerations  in  mind  we  have  arrived  at  the  following 
compromise.  The  user  must  supply  a  very  gross  triangulation  of  the  domain, 
reflecting  the  desired  grading  of  the  net,  and  with  triangle  boundaries  lying 
in  any  desired  position.  This  removes  both  of  the  problems  raised  above. 

The  large  triangles  can  then  be  subdivided  by  the  computer  in  the  obvious 
manner.  If  in  addition  the  program  has  the  capability  of  subdividing 
triangles  having  one  curved  side,  the  amount  of  input  for  most  domains  can 
be  kept  small. 

The  algorithm  used  to  subdivide  each  input  triangle  is  very  simple. 
i*'or  some  integer  k  ,  depending  on  how  fine  a  final  mesh  is  required,  each 
triangle  side  is  evenly  divided  into  k  segments  by  k-1  nodes. 

2 

Nodes  of  consecutive  sides  are  joined  by  parallel  lines  yielding  k 
triangles,  each  congruent  to  the  original  large  one.  This  has  the 
advantage  that  no  sharp  angles  are  generated;  the  smallest  angle  in  the 
original  triangulation  is  the  same  as  the  smallest  in  the  final  triangulation. 
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For  "curvilinear"  triangles  (having  one  curved  side)  the  algorithm  is 
similar.  Suppose  we  have  the  following  triangle  (Fig.  3-1-&)  which  we 
must  refine  ty  a  factor  of  eight  (Fig.  3»l-b) . 


Fig.  3*l-a  Fig.  3.1-b 

Seven  node  points  are  generated  on  each  of  the  straight  lines  AB  ,  EC  , 
and  AC  as  described  above.  The  seven  node  points  on  the  curve  BC  are 
then  obtained  by  finding  (approximately)  the  points  of  intersection  of  the 
curve  with  lines  perpendicular  to  the  straight  line  BC  and  passing  through 
the  node  point's  on  it.  The  node  points  on  AB  and  AC  are  each  joined  to 
the  node  points  on  the  curve  as  in  Fig.  3-1-b  by  straight  lines,  and  their 
points  of  intersection  are  then  used  to  form  the  triangles. 
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This  approach  to  mesh  generation  could  vary  conveniently  be  adapted 
to  use  with  an  interactive  display  system.  The  fact  that  the  user  has 
control  of  the  mesh  while  not  being  obliged  to  provide  large  amounts  of 
input  is  particularly  attractive  in  this  regard. 


4 .  An  Automatic  Two-Dimens tonal  Domain  Triangulator 

In  this  section  we  present  a  scheme  for  automatically  triangulating 
a  two-dimensional  domain.  Unlike  the  fully  automatic  schemes  discussed 
in  Section  2.2,  this  method  does  not  utilize  a  regular  mesh;  in  fact,  it 
specifically  is  designed  to  allow  for  the  construction  of  graded  nets.  It 
can  he  used  for  general  simply  connected  domains,  as  the  examples  appearing 
later  will  demonstrate. 

The  basic  strategy  of  the  method  is  as  follows.  The  user  is  required 
to  supply  the  initial  boundary  as  a  sequence  of  arcs,  along  with  a  simple 
rule  indicating  how  each  arc  is  to  be  subdivided.  The  sequence  of  arcs 
must  form  a  closed  loop,  so  for  now  we  assume  R  has  no  holes.  We  then 
have  an  "initial  boundary"  consisting  of  a  sequence  of  nodes  connected  by 
straight  lines.  We  then  proceed  to  annihilate  R  by  successively 
removing  triangles  from  R  ,  as  depicted  in  Figure  4.1.  As  each  triangle 
is  removed,  we  obtain  a  new  "current  boundary".  This  boundary,  along 
with  some  associated  information  can  be  conveniently  stored  as  a  two-way 
linked  list.  Our  goal  is  to  cover  (or  annihilate)  R  with  as  few 
triangles  as  possible  consistent  with  the  requirements  that  the  mesh  vary 
smoothly  and  have  no  sharp  angles  or  long  sides.  For  example,  for  a  unit 
square  domain  with  each  side  divided  into  segments  of  length  0.01  ,  we 
would  like  the  generated  mesh  to  be  composed  largely  of  triangles  which 
are  close  to  equilateral  triangles  having  sides  of  length  0.01  . 

We  will  employ  two  methods  of  forming  triangles.  The  first,  which 
we  will  refer  to  as  "trimming",  is  depicted  by  (i),  (iii),  (iv)  and  (vi) 
in  Figure  4.1.  The  second  method  of  generating  triangles  requires  the 
generation  of  a  node  in  R  ,  as  shown  by  (ii)  and  (v)  in  Figure  4.1.  We 
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will  refer  to  this  method  as  "notching" . 

First  we  discuss  the  generation  of  nodes.  Consider  the  diagram 

below 


Figure  4.2 


Let  l  be  the  average  distance  between  consecutive  nodes  in  the 
initial  boundary  dR  .  Then  P  is  the  point  on  the  bisector  of  angle 
which  approximately  minimizes 

0.1)  ?(P)  =  |c-b2j  +  |c-b-|  +  r  £  |  a1-'b1 1  +  K-bJ}  +  —  tla^Ij  +  ja2-J| 

where 

4 

7  -  £  |b,-b|  /  > 

i=l 

and 

b  =  £b  /4  . 
i=l 

The  first  two  terms  are  designed  to  make  the  (potential)  triangles 
close  to  equilateral.  The  third  term  has  a  smoothing  influence  on  the 


lengths  of  the  arcs  of  the  current  boundary,  and  the  last  term  attempts  to 
make  the  lengths  of  the  arcs  of  the  current  boundary  converge  to  l  .  If 
either  or  both  of  the  neighboring  vertices  nave  angles  less  than  5rr/ 6  , 
the  same  procedure  is  performed  at  these  vertices,  yielding  two  or  three 
nodes.  Their  centroid  is  chosen  as  the  trial  node. 

Now  that  we  have  a  method  for  generating  interior  nodes,  we  can  now 
describe  the  algorithm.  In  words  it  is  as  follows: 

Step  1* 

For  each  vertex  on  the  current  boundary  having  interior  angle  a 
less  than  or  equal  to  tt/3  ,  form  a  triangle  by  trimming  and  remove  it 
from  E  ,  as  depicted  below. 


Figure  b.3 


Step  2. 

Find  any  consecutive  vertices  both  having  interior  angles  Qi^  and 
a2  less  than  5tt/6  .  if  none  are  found,  proceed  to  step  3.  Otherwise, 
choose  the  pair  with  the  minimum  value  of  -  2rr/3l  *  la2  "  2rr/^  I  > 

and  generate  an  interior  node  P  as  described  above.  We  then  have  a 
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situation  such  as  one  of  these  below: 


Figure  4.4 


It  is  obvious  what  we  should  do  in  Figure  (4.4-a),  but  in  Figure (4 .4-b), 
it  is  debatable  whether  we  should  trim  triangle  ,  or  notch 

triangle  PPTP.  (followed  presumably  by  two  trims  of  triangles  PP_P^, 

j  d  ^ 

and  PP^  .)  Let  6..^  =  jPg-P^j  and  d£  =  jP^-P^I  ,  where  |  Pi-P^  I 
is  the  distance  between  points  P.  and  P.  .  Let 

=  ft^d^b^b  )  ,  S2  =  [b^bg^d^b^}  and  S,  =  [b^a^ag^b,.}  .  Let 
V1  ’  V2  an<*  v5  536  average  value  of  the  members  in  ,  Sg  and 
respectively.  Wow  define  w^  ,  Wg  and  w^  by 
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^  » 


wi =  (I.Vvii +  I di_vi i +  IVvii +  !Vvii)/4vi  * 

w2  =  0VV2i  +  IV^I  +  I d2_v2 1  +  lb5"v2l)  /W2  ' 

w3  =  ( lbi~v3 1  +  1  ai“v3 1  +  la2"v3 1  +  lb5"v3l)/W3  • 

Let  =  lainfr-^Wg}  .  Then  if  k  =  1{2]  ,  \  >  w5  ^  and.  angle 

PlP3Plt-  ^P3PUP5^  is  less  than  or  ec3.ual  to  tt/2  ,  then  trim  triangle 
P2P3Pi4  lp3php5}  •  Otherwise,  notch  triangle  PP  P^  .  Then  go  to 
step  1. 

Step  3- 

Pind  any  vertex  having  interior  angle  a  <  tt/2  .  If  there  are  none, 
go  to  step  It.  Otherwise  compute  an  interior  node  corresponding  to  the 
vertex  as  indicated  below. 


P 


Figure  k.5 
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Let 


d5  =  lp2"?l|l  and  dk  =  lp“p3i  *  define 


4>y 


v4 


,  w^  and  w 


VU  =  (bl  +  d3  +  V  /  5  > 

v5  =  (bl  +  ai  +  a2  +  V  /  h  ’ 

wk  =  (iVv4l  +  Id3-vl4-i  +  IVvfc|)/>4  > 

w5  =  ( lbi~v5 1  +  I  ai~  v5 1  +  iVb5l  +  * 

If  w)(  <  w5  then  trim  triangle  P^P^P^  and  go  to  step  1.  Otherwise 
repeat  step  3  until  a  successful  trim  is  achieved  or  until  all  the  vertices 
have  been  tested.  If  no  trim  can  be  made,  proceed  to  step  4. 


Step  4. 

Let  and  p^  be  't’rie  smallest  and  largest  distances  between 

consecutive  nodes  in  the  current  boundary.  We  then  do  the  following 

P1 

(4a)  Set  7  =  • 


(4b)  Of  those  nodes  on  the  current  boundary  having  at  least  one  of 

its  incident  boundary  arcs  less  than  or  equal  to  y  ,  choose  the  node 
having  tue  smallest  angle  <X 


(4c)  If  cc  <  ti  ,  generate  a  node,  as  in  Figure  4,2,  notch  a  triangle 

(either  PJ^P^  or  PP,Pj,)  and  go  to  step  1. 

Otherwise,  go  to  (4d). 


(4d)  If  7  =  p0  ,  stop  (we  have  failed), 
and  go  to  (4b) . 


Otherwise,  set 
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Remarks ; 


(1)  The  parameter  7  in  step  4  was  found  to  be  necessary  to  force  the 
program  to  consider  first  those  areas  of  the  domain  to  be  covered  by  a 
relatively  fine  mesh.  The  averaging  effect  built  into  the  node  generator 
combined  with  this  restriction  on  the  lengths  of  the  arcs  considered  first 
tends  to  fill  in  the  domain  near  the  short  boundary  arcs  first;  the  size  of 
the  triangles  increases  with  distance  from  the  boundary. 

(2)  Steps  1,  2  and  3  are  designed  to  remove  any  "protrusions"  from  the 
current  domain.  Their  overall  effect  is  to  make  the  current  boundary 
convex  or  near  convex. 

(3)  An  interesting  and  potentially  better  method  for  generating  nodes 
might  be  to  allow  P  (Figure  4.2)  to  lie  anywhere  in  tne  current  domain, 
rather  than  restrict  it  to  lie  on  the  bisector  of  the  angle  PpP^P^  . 
Minimizing  !?(P)  would  be  considerably  more  complicated,  but  might  be 
justified  if  triangulations  with  many  fewer  triangles  resulted. 

(4)  In  all  cases  where  a  node  is  generated,  we  check  to  see  if  it  lies 
in  the  current  domain  by  using  an  algorithm  described  in  [ N2  ] ^  and  before 
forming  any  triangle  we  check  to  make  sure  no  nodes  lie  inside  the  triangle. 
Thus,  our  algorithm  is  "fail  safe";  if  it  terminates  successfully,  it 

has  generated  a  legal  triangulation. 

(5)  we  mentioned  above,  the  current  boundary  can  best  be  stored  as  a 
linked  list,  so  that  deletions  and  insertions  can  be  carried  out  with 
little  data  rearrangement.  To  reduce  computation,  the  lengths  of  each 
boundary  arc  and  the  sine  and  cosine  of  each  interior  angle  were  also 
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retained  in  conjunction  with  the  linked  list.  These  quantities  were 
computed  once  by  the  routines  "trim”  and  "notch"  which  actually  modify 
the  current  boundary,  and  were  then  available  as  needed  by  steps  1-4. 

Other  quantities  might  also  have  been  retained. 

(6)  Figure  5-10  demonstrates  the  use  of  the  algorithm  when  the  domain 
has  a  hole  in  it.  We  simply  provide  a  "boundary  arc"  cutting  through  the 
domain,  joining  the  outer  boundary  to  the  inner  one.  The  fact  that  the 
closed  loop  forming  the  boundary  overlaps  itself  and  in  some  parts  does 
not  really  correspond  to  a  boundary  at  all  does  not  effect  the  algorithm. 
The  smoothing  program  (discussed  below)  does  not  move  node  points  lying  on 
these  pseudo  boundary  arcs;  hence,  this  device  can  be  used  to  force  some 
inter-element  boundaries  to  lie  in  specified  positions.  In  Section  2.5 
wc  explained  why  this  might  sometimes  be  desirable. 

Below  are  several  examples  of  domain  triangulations.  The  output 
of  the  algorithm  described  above  has  been  smoothed  by  carrying  out  three 
or  four  sweeps  of  the  interior  (non-boundary)  nodes  using  formula  (2.1). 

The  nodes  on  the  curved  portions  of  the  boundary  were  obtained  in  the 
same  manner  as  described  in  Section  2.5* 

As  we  implied  in  remark  4  above,  more  sophisticated  methods  of  node 
generation  and  trim/notch  strategies  might  yield  "better"  triangulations, 
and  such  investigations  are  potentially  fruitful  topics  of  further  research. 
It  is  even  difficult  to  define  precisely  what  we  mean  by  a  good  graded  mesh. 
It  depends  on  the  relative  importance  of  (a)  sharp  angles  (b)  the  total 
number  of  triangles  (c)  the  smoothness  of  variation  of  the  mesh,  and 
perhaps  other  factors.  It  would  be  nice  also  to  be  able  to  a  priori 
guarantee  certain  desirable  characteristics  of  the  generated  mesh  in  terms 
of  characteristics  of  the  initial  boundary. 
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Figure  4.7 


Figure  4.8 


Figure  4.9 
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A  Storage  Scheme  for  Finite  Element  Meshes  and  Associated  Boundary  Data 


As  we  have  tried  to  emphasize  in  the  preceding  sections,  the  finite 
element  mesh  will  probably  not  be  regular;  thus  the  storage  scheme  for 
it  must  be  general.  In  this  section  we  present  a  scheme  for  storing 
general  finite  element  meshes,  and  show  that  for  most  elements,  the 
required  storage  is  small  compared  to  the  storage  required  to  store  the 
non-zero  elements  of  the  coefficient  matrix. 

We  shall  see  in  Chapter  3  that  the  procedure  for  generating  the  finite 
element  equations  is  carried  out  element -by- element;  therefore,  it  is 
beneficial  to  be  able  to  retrieve  the  node  coordinates  for  each  triangle 
easily.  On  the  other  hand,  we  do  not  want  to  store  copies  of  the  node 
coordinates  for  each  triangle, because  many  or  all  of  the  nodes  belong  to 
more  than  one  triangle.  Another  point  is  that  we  really  only  need  to 
remember  the  vertices  of  the  triangles  in  the  mesh;  node  coordinates  on 
the  sides  and  in  the  interior  of  the  triangle  can  be  generated  as  needed, 
provided  we  have  a  formula  for  generating  them. 

For  definiteness,  suppose  our  mesh  has  V  vertices,  S  triangle 
sides,  triangles,  and  H  holes  in  it.  The  number  of  interior  sides 

{vertices}  and  boundary  sides  {vertices}  will  be  denoted  by  ST{VT}  and  S-Jv.,} 

11  B  B 

respectively.  In  [El]  the  following  relations  between  these  mesh  parameters 
are  proved. 

(5-1)  Na  =  |  (SB  +  2SX)  =  VB  +  2VX  -  2(H-1)  . 

For  a  typical  mesh  having  S  »  S  ,  VT  »  V_  ,  and  small  H  ,  the 

1  D  1  D 

the  relations  (5*1)  yield 
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(5-2) 


and 


To  aid  in  describing  the  scheme  we  are  about  to  present,  consider 
the  figure  below,  where  the  domain  has  been  covered  by  "3-10"  elements 
(see  Appendix  A  for  details).  The  nodes  are  numbered  sequentially, 
beginning  with  the  vertex  nodes,  followed  by  the  arc-midpoint  nodes  (see 
below),  followed  by  the  nodes  on  the  sides  and  interiors  of  the  triangles. 

A  node  with  tag  k  is  understood  to  have  coordinates  (x,  ,y.  )  .  The 
circled  numbers  are  boundary  reference  numbers  which  are  associated  with 
the  corresponding  triangle  sides.  Later,  boundary  conditions  can  be 
assigned  with  respect  to  these  numbers.  The  arc -midpoint  nodes  tagged 
6  and  7  are  generated  and  allowed  for  in  the  storage  scheme  so  that  some 
form  of  interpolation  along  the  boundary  can  be  subsequently  done.  See 
Ziamal  [Z 6]  for  one  such  possibility,  where  quadratic  interpolation  is  used. 


The  storage  scheme  is  depicted  schematically  below.  Note  that  the  pointers 
for  each  triangle  are  listed  in  a  counterclockwise  manner,  in  order  of 

bj 


vertices,  sides,  and  interior.  Suppose  the  vertices  are  numbered 
(xl'^l)  >  (X2’y2^  (x.,y^)  •  We  adopt  the  convention  that  the  i-th 

side  of  the  triangle  is  the  one  with  endpoints  (x.,y.)  ,  (x  ,y  )  , 
where  k  =  (i+l)  mod  3  • 

/  pointer  will  ordinarily  require  fewer  bits  than  a  node  coordinate 
or  a  coefficient  of  an  equation.  For  example,  on  an  IBM  360  computer, 
pointers  may  conveniently  be  stored  in  two  bytes  (a  half-word)  whereas  a 
coordinate  would  require  four  or  eight  bytes .  In  general,  we  will  denote 
this  ratio  by  a  (a  <  1)  .  Ignoring  the  storage  required  for  the 
boundary  table  (since  we  assume  S  «  ST)  ,  then  the  amount  of  storage 
required  for  the  mesh  is  approximately 

(5.4)  vM  =  amNA+2V  4  (om+i)NA  , 

where  m  is  the  number  of  nodes  associated  with  each  element. 

Let  n  ,  n„  and  n  be  the  number  of  parameters  associated, 

V  b  X 

respectively,  with  vertex  nodes,  the  node(s)  on  each  triangle  side  (not 
including  the  endpoints),  and  the  interior  of  each  triangle.  For  example, 
element  3-10  would  yield  n  =  1  ,  nQ  =  2  ,  and  n  =  1  .  We  now  want  to 
show  that  VM  is  usually  small  compared  to  the  number  of  non-zero 

elements  in  the  coefficient  matrix  A  .  In  Section  4.6  we  show  that 

(5-5)  4  4  ^(V-3)  +  a2(S+3-2V) 

a-,  + 

=  ( — 2 — “)  \  *  (using  (5.2)  and  (5.3)) 

where  j.,  and  depend  on  n„  ,  n0  ,  and  n.r  . 

X  d  V  b  X 
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The  following  table  serves  to  make  our  point. 


+ 

Element 

CT1 

I 

HH 

Cto+l 

a  =¥ 

H|CJ 

II 

0 

a  =  1 

1-3 

5 

2 

3.5 

7/4 

2 

5/2 

4 

2-6 

27 

19 

23 

5/2 

3 

4 

7 

3-4 

64 

37 

50.5 

2 

7/3 

3 

5 

3-10 

84 

69 

76.5 

14/4 

13/3 

6 

11 

1 4-6 

161 

106 

133.5 

5/2 

3 

4 

7 

4-15 

176 

188 

19/4 

6 

9/2 

16 

5-6 

272 

139 

205.5 

5/2 

3 

4 

7 

5-21 

405 

370 

387.5 

25/4 

8 

23/2 

22 

+See  Appendix  A  for  a  description  of  the  elements. 

Table  5*1 

Thus  for  all  but  piecewise  linear  polynomials,  V  «  ,  even 

when  a  =  1  .  If  a  direct  method  is  being  used  to  solve  the  generated 
system,  the  storage  required  for  the  decomposition  will  be  much  more  than 
,  so  that  becomes  rather  insignificant  in  comparison  to  overall 
storage  requirements.  Our  conclusion  is  that  the  use  of  an  irregular 
rather  than  regular  mesh  for  finite  elem<-  methods  does  not  in  general 
cause  an  important  increase  in  storage  requirements. 

In  future  chapters  we  will  often  need  the  dimension  of  A  ,  the 
number  of  parameters  (unknowns)  in  our  problem.  Using  (5*2),  (5*3),  and 


( 

) 


L 
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the  definition  of  V  ,  S  ,  n^  ,  ng  and  n^  above*  we  have 
(5.6)  N  =  nvV  +  ngS  +  n^ 

=  (“i  +  1  “v  +  2  ns^  N& 


CHATTER  3 


GENERATION  OF  FINITE  ELEMENT  EQUATIONS 

1.  Introduction 

In  this  chapter  we  discuss  in  detail  the  computation  involved  in  the 
generation  of  the  finite  element  equations.  The  general  procedure  with 
minor  variations  appears  rather  frequently  in  engineering  articles 
(usually  with  regard  to  a  specific  problem  and  element);  Zlamal  [Z5]  has 
also  described  the  procedure,  again  for  a  specific  situation.  Felippa 
and  Clough  [  FI]  give  an  excellent  summary  of  the  generation  process 
although  they  give  few  details.  Unfortunately,  we  feel  that  too  little 
emphasis  is  devoted  to  carefully  identifying  which  of  its  several  sub-tasks 
are  independent  of  others,  and  which  ones  are  dependent  only  on  particular 
components  of  the  problem  being  solved.  For  example,  is  a  specific 
computation  dependent  only  on  the  characteristics  of  the  piecewise  polynomial, 
and  independent  of  the  differential  operator  and  the  boundary  conditiors? 

IIow  much  of  the  computation  can  be  salvaged  if  only  part  of  the  problem  is 
changed  and  hew  can  that  amount  be  maximized  for  a  given  change?  Answers 
to  questions  such  as  these  are  important  in  the  design  and  implementation 
of  efficient  programs.  In  this  chapter  we  identify  these  various  sub-tasks 
and  indicate  which  parts  of  the  generation  procedure  can  be  isolated  as 
separate  modules.  The  equation  generation  phase  is  itself  inherently 
modular,  even  though  in  its  entirety  it  is  usually  regarded  as  the  second 
of  three  stages  in  the  application  of  the  finite  element  method.  The 
first  phase  is  the  mesh  generation,  and  the  third  is  the  solution  of  the 
generated  algebraic  system. 
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As  we  stated  in  Chapter  1,  the  finite  element  method  is  a  Ritz- 
Galerkin  method  where  the  trial  functions  have  small  support.  That  is, 
the  approximate  solution  is  represented  in  terms  of  a  local  basis. 

Generation  of  such  a  basis  for  rectangular  domains  is  fairly  straight¬ 
forward,  as  we  described  in  Section  l.k.  However,  for  domains  of 
arbitrary  shape,  where  it  is  not  convenient  or  possible  to  restrict  the 
support  of  the  basis  functions  to  rectangles,  a  different  approach  is 
necessary,  and  is  provided  by  the  use  of  so-called  interpolation  polynomials 
[FI,  Zl].  The  construction  of  such  polynomials  and  their  relationship  to 

the  local  basis  is  the  subject  of  Section  2. 

■** 

Once  we  have  the  basis  for  our  approximate  solution  v(x,y)  ,  the 
next  step  is  to  carry  out  the  integrations  required  to  obtain  the  coefficients 
of  the  linear  system,  as  described  in  Chapter  1,  Section  2.  We  emphasize 
that  the  computational  procedure  is  considerably  different  from  the  formal 
description  appearing  in  Chapter  1.  The  integrations  required  to  determine 
the  coefficients  are  carried  out  element -by-element,  and  the  actual  basis 
functions  are  not  (explicitly)  generated  at  all.  This  computation,  where 
the  equations  are  actually  generated,  is  the  subject  of  Section  3* 

The  last  part  of  the  generation  procedure  is  usually  referred  to  as 
assembly  of  the  equations,  or  just  "assembly",  and  is  the  subject  of 
Section  b.  Suppose  our  (linear)  elliptic  boundary  value  problem  is  cast  in 
a  variational  form,  with  a  functional  I  [  v]  that  we  wish  to  minimize  with 
respect  to  the  parameters  of  v  .  The  result  of  the  element -by- element 
process  described  in  Section  3  is  a  set  of  small  quadratic  functions,  each 
one  representing  a  contribution  to  I  [v]  of  a  particular  subdomain  (element) 
of  the  domain  R  .  These  small  functions  have  some  parameters  in  common. 
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and  the  process  of  combining  these  functions  into  a  single  large  one  is 


the  task  referred  to  as  ''assembly".  The  elimination 
values  are  determined  by  boundaiy  conditions  is  also 


of  parameters  whose 
done  at  this  stage. 


5}' 


2 .  Construction  of  Interpolating  Polynomials 

In  this  section  we  describe  the  construction  of  interpolating  polynomials 
on  triangles.  However,  the  procedure  and  many  of  our  remarks  apply  for  a 
general  polygon.  Let  R  be  a  simply  or  multiply  connected  domain  in 
the  (x,y)  plane  with  piecewise  linear  boundary  Sr  .  Zlimal  [z6]  has 
described  a  method  for  removing  this  restriction  on  Sr  .  We  assume  R 
has  been  triangulated  into  N  triangles,  with  adjacent  triangles  having 
either  a  common  vertex  or  a  common  side  and  with  the  union  of  the  closed 
triangles  equal  to  R  U  SR  .  An  example  of  a  domain  triangulated  in  this 
way  appears  below. 


Figure  2.1 

Our  aim  is  to  construct  a  piecewise  polynomial  of  degree  d  on 
R  U  SR  •  To  do  this  we  assume  that  on  each  triangle  Tv  c  R  U  Sr  , 
y(x,y)  is  a  polynomial  pv(x,y)  of  degree  d  .  We  impose  the  conditions 
that  pv  and  p'  on  neighboring  triangles  have  common  values  and/or 
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derivatives  at  node  points  lying  on  their  common  boundary.  We  begin  by 
studying  the  choice  of  parameters  necessary  to  have  v(x,y)  of  class 
C(ct)  .  This  problem  has  also  been  considered  in  [H2]  for  general 
polygons,  and  we  give  a  special  case  of  their  arguments  below. 

Consider  the  figure  below,  depicting  two  adjacent  triangles  Tv  ana 
7 

T  having  common  boundary  L  .  Directions  tangent  and  normal  to  L  will 
be  denoted  respectively  by  s  and  n  .  Thus  ^  (Q  )  is  the  derivative 
of  v  normal  to  L  evaluated  at  Q  .  The  notation  v(s)  will  mean  the 
function  v  evaluated  at  the  point  Q,^+  s(Q^-Q  )  . 


Figure  2.2 


which  implies  v(s)  will  be  continuous  along  L  if 


(2-3) 


d+1  =  20+1)  . 


1  v 


If  (.2*3)  is  satisfied,  — is  a  polynomial  of  degree 


on 

d-k  in  s  ,  having  d-k+1  coefficients.  Thus  we  require  d-k+1 

vk  \>  sk  7 

o  o  ,  o  v 


conditions  of  agreement  between 


dn 


•sk 

and  —  along  L  if  — 7 
Snk  Snk 


is  to  be  continuous  along  L  .  The  conditions  (2.2)  imply 


(2-M 


^PV(Q,)  *V«L) 


ba? 


i  =  1,2  ,  j  <  p-k  , 


imposing  2(p-k+l)  conditions.  Therefore,  we  need  d-k+1  -  2(0-k+l)  = 

2(p+l)-k-2(p-k+l)  =•  k  more  conditions  of  agreement  imposed  on 
^k^v  ^k  7 

— *r-  and  — — _  .  Carrying  out  the  same  arguments  for  k  =  1,2,  ...,a 
cm  cm 

and  summing  implies  we  need  Cr(o+l)/2  additional  "normal  derivative" 
parameters  situated  at  nodes  along  L  if  v  is  to  be  of  class 
along  L  .  Using  the  fact  that  the  number  of  coefficients  mi  a  general 
d-th  degree  polynomial  in  two  variables  is  n.  =  (d+1) •  d+2)/2  yields 
the  inequality 


(2-5) 


(d+lKd+2)  >  ^(p+lHfi+2)  ,  q(g+l)  ^ 


where  the  term  -(P-— is  -*-,he  number  of  derivatives  DTv  ,  jr  |  <  p  . 
The  factor  3  appears  because  a  triangle  has  3  sides  and  3  vertices.  The 
inequality  (2.5)  yields  the  conditions 


(2.o) 


@(p+l)  >  3a(o+l't  and  d  =  2p+l 
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Surplus  degrees  of  freedom  in  the  polynomial  can  be  associated  with  nodes 
in  the  interior  of  the  triangle.  For  approximation  properties  of  these 
piecewise  polynomials  see  [Z1,Z4,Z 5]. 

The  conditions  (2.6)  imply,  in  particular,  that  we  require  d  to  be 
at  least  5  [93  for  v  to  be  in  C^'  [C  ^ }  .  Note  that  this  applies 
only  to  the  polynomials  described  above.  A  common  technique  used  to  reduce 
the  number  of  parameters  in  the  problem  is  to  restrict  the  polynomial  of 
degree  d  on  each  triangle  to  be  of  degree  d-k  ,  k  >  0  in  parts  of  the 
triangle.  For  example,  Goel  [G2 ]  begins  with  the  3-4  element  (Appendix  A) 
and  by  a  suitable  modification  forces  the  normal  derivative  to  each  side 
of  the  triangle  to  vary  linearly  along  the  boundary.  Agreement  in  value 
and  first  derivatives  at  the  vertices  Q,^  and  guarantees  continuity 

in  the  first  derivatives  along  L  .  Zienkiewiez  [Z3 ]  and  Clough  and  Tocher 
[Cl  ]  also  present  techniques  for  achieving  the  same  goal.  Irons  [11  ] 
describes  a  method  for  constructing  a  quartic  element  generating  a 
piecewise  polynomial  subspace  veC^'  .  Bell  [b6  1  describes  a  method 
for  eliminating  the  side  parameters  on  the  5-6  element  by  imposing  the 
condition  that  the  derivative  of  the  polynomial  normal  to  each  triangle 
side  be  a  cubic  rather  than  a  quartic.  Zl&mal  [z6  ]  uses  a  similar  technique 
to  eliminate  the  centroid  parameter  from  element  3-4. 

We  will  refer  to  elements  of  the  type  just  described  as  deficient 
elements,  to  distinguish  them  from  eleme.  its  which  are  polynomials  of  a 
particular  degree  over  the  whole  triangle.  We  have  restricted  our 
studies  in  this  thesis  to  non-deficient  elements.  (An  explanation 
appears  at  the  end  of  this  section.) 
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We  now  turn  to  the  actual  construction  of  interpolating  polynomials. 

Let  the  number  of  nodes  associated  with  each  triangle  be  m  =  5(nig+l)+imj.  , 
where  tn0  >  0  is  the  number  of  nodes  on  each  triangle  side  (not  including 

D  — 

the  endpoints),  and  let  m^.  >  0  be  the  number  of  nodes  in  the  interior 
of  each  triangle.  We  denote  the  total  number  of  nodes  in  the  domain  by 
M  ,  and  the  coordinates  of  the  nodes  by  =  (x^y^)  ,  i  -  1,2,  ...,M  . 

The  indices  of  the  nodes  of  triangle  Tv  will  be  denoted  by  \n,  \>2>  •  •  •  >  v  , 
with  the  vertex  nodes  coming  first  in  counterclockwise  order,  followed  by 
the  side  nodes  also  in  counterclockwise  order,  followed  by  the  interior 
nodes  (in  no  specific  order).  When  m^  >  0  we  assume  that  the  side  nodes 
evenly  sub-divide  the  triangle  sides.  Triangle  Tv  is  depicted  in 
Figure  2.5  below. 


Triangle  \>2 
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To  simplify  the  notation  in  the  sequel,  we  will  assume  that  \k  =  i  . 
We  begin  by  expressing  pv(x,y)  in  the  form 

(2.7)  ?v(x,y)  =  [  aj  cp[d^  =  «v  cp^  =  qf  d^  av  , 

k=l 

/  n  \ 

where  qA  '  is  the  n^-dimensional  column  vector  whose  elements  are  the 
monomial  terms  of  the  general  d-th  degree  polynomial  in  two  variables. 

We  assume  that  the  terms  appear  in  order  of  non-decreasing  degree,  and  in 
increasing  powers  of  y  for  consecutive  terms  of  the  same  degree.  For 
example, 

(2.8)  qP)  =  (l,x,y,x2,xy,y2,x3,x2y,xy2,y5)  . 


The  superscript  d  will  not  usually  be  included  explicitly.  The  vector 
av  contains  the  coefficients  of  pv  ,  and  and  q>^  refer  to  the 
k-th  components  of  av  and  cp^d'  respectively. 

Now  our  goal  is  to  represent  pv  on  Tv  in  terms  of  its  nodal 
parameters.  For  example,  if  d  =  1  (n^  =  3)  ,  pV  can  be  uniquely 
characterized  by  its  values  at  the  vertices  of  Tv  .  If  pJ  is  a  cubic 

polynomial  (n^  =  10)  ,  one  way  to  characterize  it  is  by  the  parameters 

DTpv(Q.)  >  i  =  1; 2, 3j  Jt|  <  1,  and  pv(Q^)  ,  where  is  at  the 
centroid  of  Tv  .  Note  that  both  of  these  characterizations  assure 
continuity  across  interelement  boundaries,  as  predicted  by  the  theory 
presented  in  the  first  part  of  this  section. 

Vie  denote  the  number  of  parameters  associated  with  node  i  by  , 
and  the  vector  of  those  parameters  by  q^  .  Its  j-th  element  will  be 

denoted  by  q.  .  .  The  parameters  associated  with  pv(x,y)  ,  ordered  as 


6o 


indicated  by  Figure  2.3,  are  then  given  by 


(2-9) 


T 

v  f  T 

i  =  (qr 


T 

q^  * 


a  ) 

in7 


New  suppose  is  a  column  vector  of  length  ^  whose  elements  are 
linear  functionals  designed  to  produce  the  parameters  associated  with 
node  i  when  it  is  applied  to  v  .  For  example,  the  vertex  nodes  for 
the  cubic  polynomial  discussed  above  would  have  associated  vector 
functionals  of  the  form 


(2.10) 


Such  an  operator  applied  to  a  j -dimensional  vector  is  understood  to 
operate  term  by  term;  a  column  vector  would  yield  a  -dimensional 

column  vector,  a  row  vector  would  produce  a  by  j  matrix.  Defining 
£V  by 


(2.11)  £v  =  (J?»  Q’  £3  >•••>  $  > 

we  have  immediately  the  identity 


(2.12)  £vfv)=qv 


Using  (2*7)  along  with  the  fact  that  v  is  pJ  on  Tv  ,  we  can 
rewrite  (2.12)  as  a  matrix  equation  involving  orv  and  qv  : 

(2.13)  qV  =  £V[pV] 

-  £V[<pV] 

-  xW 

=  cvav 
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As  we  stated  in  Section  3*2,  we  are  restricting  our  basis  to  be 

polynomials  of  a  specific  degree  on  each  element  ( non-deficient  elements) 

m 

so  we  assume  that  n^  =  ^  .  Cv  will  be  non-singular  provided  our 

node  points  are  distinct  and  our  parameters  associated  with  each  node 
point  are  linearly  independent. 

Using  (2.13)  in  (2.7)  yields 

T  T  -T 

(2.14)  pV(x,y)  =  aV  q>  =  q*  CV  cp  , 


giving  the  polynomial  on  Tv  in  terms  of  the  parameters  which  we  have 

-T  -1  T 

chosen  to  characterize  it.  Here  the  notation  Cv  means  (Cv  ) 

Consider  again  the  cubic  example  discussed  above  on  triangle  Tv 
having  vertices  =  (x^y^  ,  i  =  1,2,3>  and  centroid  =  (x^,y^)  . 
Thus  iu  =  3  >  i  =  1,2,3  }  and  =  1  .  Then  qv  is 
T 


(2.15) 


=  (vx,v1; 


x'vl,y'Vv2,x'v2,y>v3'y 


3,x'v3,y'V 


where  v.  ,  denotes  the  first  partial  derivative  ol  v  with  respect  to 
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We  can  write  (2.14)  in  the  form 


(2.16) 


pv(^y) 


nd 

■  I  < 

k=l 


•*d 


k=l 


v 

k 


“i 


y 


where  is  associated  with  the  j-th  parameter  of  node  k  .  The  t’s 

k,  3 

are  often  referred  to  as  coordinate  functions  in  the  engineering  literature, 

and  are  the  members  of  the  local  basis  in  terms  of  which  v(x,y)  is  being 

represented.  It  is  easy  to  verify  that  .  vanishes  on  the  boundary 

k,  3 

of  the  union  of  the  triangles  to  which  node  k  belongs,  provided  the 

nodal  parameters  have  been  chosen  to  guarantee  continuity  across  interelement 

boundaries.  The  function  ijr  .  is  defined  to  be  zero  outside  the  region. 

k;  J  M 

There  will  obviously  be  N  =  ]T  parameters  and  corresponding 

k=l 

basis  functions  in  the  representation  of  v  on  R  . 

The  procedure  we  have  described  for  generating  the  basis  functions  is 
in  a  sense  quite  general.  The  generation  of  the  matrix  Cv  can  be 
isolated  in  a  subroutine,  and  the  only  required  input  is 

(i)  the  degree  d  of  the  polynomial, 

(ii)  the  node  coordinates  (or  a  formula  for  generating  them), 

(iii)  the  nodal  parameters. 

Item  (ii)  is  supplied  by  the  mesh,  while  (i)  and  (iii)  can  be  specified  by 
the  user.  Each  row  of  Cv  is  obtained  by  evaluating  the  components  of  cp 
at  the  corresponding  node  coordinate  (perhaps  after  differentiating  them, 

if  the  corresponding  parameter  is  a  derivative  parameter) .  Observe  that 

l  l 
1  2 

the  matrix  terms  are  simple  monomial  terms  of  the  form  x^  y^  ,  £^, >  0  . 
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Differentiation  of  them  can  be  easily  done  symbolically,  with  obvious 

computational  advantages.  Furthermore,  er tries  in  each  row  will  have 

l  l 

common  factors  of  the  form  x^  and  y^  ,  £  >  0  .  Thus,  the  generation 
of  Cv  can  be  implemented  in  an  efficient  as  well  as  general  way. 

Provided  we  choose  linearly  independent  parameters  equal  in  number 

to  (d+l)(d+2)/2,  the  matrix  Cv  will  be  non-singular  and  we  can  obtain  the  basis 

-T 

functions  on  TV  in  the  factored  form  =  [Cv  <p}^  .  Having  pV(x,y) 
in  the  form  (2.1^)  is  pe.rticularly  convenient  for  our  intended  applications. 
Anticipating  the  next  section,  observe  that  on  triangle  Tv  the  following 
equation  holds 

(2.17)  DV(x,y)  =  qVCV  DT<p  , 

where  the  differential  operator  DT  operates  term  by  term  on  <p  .  Thus, 
if  the  basis  functions  are  derived  in  this  factored  form,  derivatives  of 
tne  trial  function  v  can  be  easily  obtained  symbolically.  Furthermore, 
for  two  differential  operators  DT  and  DCT  ,  expressions  of  the  form 
DTpv(x,y)DapV(x,y)  become 

(2.13)  qV  CV  [DTq>{D°<p}T]CV  qV  , 

and  again  the  matrix  in  the  square  brackets  can  be  obtained  sym  olically. 

I  £ 

1  2 

Its  terms  ere  monomial  terras  of  the  form  x  y  ,  £^,  £ ^  >  0  . 

Note  that  the  ease  with  which  we  can  manipulate  the  basis  functions 
depends  upon  being  able  to  express  each  basis  (coordinate)  function  as  a 
linear  combination  of  monomials.  For  some  deficient  elements  this  is 
not  possible,  and  differentiation  and  integration  of  the  coordinate 
functions  must  be  done  numerically  [Hi]  and/or  carried  out  by  hand  and 


6k 


programmed  explicitly.  This  would  not  be  particularly  disadvantageous 
for  a  special  program  designed  to  solve  a  specific  class  of  problems. 

Also,  in  a  production  setting,  many  of  the  computations  involving  the 
functions  can  be  done  once  and  the  results  stored  in  a  library.  However, 
from  our  poinr  of  view  of  designing  a  general  purpose  program  we  have 
favored  the  use  of  non-deficient  elements,  which  guarantee  the  invertibility 
of  Cv  and  the  representation  of  the  basis  functions  as  linear  combinations 
of  monomial  terms. 
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3.  Generation  of  the  Equations 

This  section  describes  the  actual  calculation  of  the  finite  element 
equations  once  we  have  an  expression  for  our  piecewise  polynomial  as 
discussed  in  the  previous  section.  Suppose  our  problem  is  cast  in  a 
variational  form,  and  we  wish  to  minimize  a  functional  l[v]  =  I^[v]+I^[ v] 
with  respect  to  the  parameters  of  v  ,  where 


a 


(3.1) 

IR[v]  =  II  (alV  a2Vy  +  a3Vy+  V  +  V)dX  dy  ’ 

I 

Li 

(3.2) 

Wv]  =  I  (a6v2+a7v)ds  ' 

Sr 

;  i 

Li 

and  v 

is  restricted  to  satisfy  a  linear  bo’mdary  condition  of  the 

i] 

(3.3) 

aQv+a_v  +  a,..v„  =  a.n  on  Sr  . 

o  9  n  10  s  11 

II 

Here  a. 

i 

,  i  -  1,2, ...,11  are  functions  of  x  and  y  ,  and  v^  i 

are  the  (inward)  normal  derivative  and  (counter-clockwise)  tangential 
derivative  of  v  on  Sr  . 

Our  interest  here  is  in  the  implementation;  consequently,  we  will 
not  concern  ourselves  with  the  range  of  boundary  value  problems  that  can 
be  covered  by  the  above  form,  or  relations  and/or  smoothness  that  the 
functions  a.  ,  j  -  1,2,  ...,11  and  v  must  possess  in  order  for  the 
problem  to  be  correctly  formulated.  Also,  we  do  not  mean  to  imply  that 
the  procedure  to  be  described  applies  only  to  the  above  functional.  It 
will  be  clear  that  the  construction  applies  to  other  quadratic  integrands 
(involving  derivatives  of  higher  order,  for  example) . 

We  begin  by  observing  that  l[v]  can  be  expressed  as  a  sum  of  the 
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contributions  from  each  triangle  Tv  c  R  U  &R  .  Thus  we  can  write 


N. 


N. 


(5A)  I[v]  =  f  iV]  «  f  <3^[v]  +  I?  tv]) 
u^l  v=l 

where  I^[v]  has  the  form  (3.1)  with  the  domain  of  integration  replaced 
by  Tv  ,  and  1^  [v]  has  the  form  (3*2)  with  the  contour  of  integration 
SR  replaced  by  Srv  ,  the  part  of  Tv  lying  on  SR  .  For  Tv  with  no 
side  on  Sr  ,  I^[ v]  is  obviously  zero  and  does  not  have  to  be 

considered.  The  basic  procedure  is  to  obtain  expressions  for  each  term 
of  the  summation  (3.4)  as  functions  of  the  parameters  of  v  . 

Consider  first  the  term  Iv[v]  corresponding  to  triangle  Tv  : 

(3-5)  lj[[v]  =  If  (a1vx+a2vxvy+a3vy+a^v2+a5v)dxdy 

Tv 

Recall  from  Section  2  that  our  expression  for  pv(x,y)  on  Tv  could  be 
written  in  the  form 

(3.6)  pv(x,y)  =  qV  Cv  cp  , 

-T 

and  we  observed  in  Section  3*2  that  D  pv  =  qvCv  D  cp  ,  where  the  operator 
DT  operates  on  the  column  vector  cp  term  by  term.  Substituting  (3*6) 
into  (3 .5)>  we  obtain  the  following  expression  for  the  first  four 
(quadratic)  terms  of  (3. 5): 

(5-7)  qV  C v  J  [[  a-jqyp^  +  a^cp*  +  a,cp  q>y  +  a^qxpT  dx  dy  V  C  “  V 


Deferring  treatment  of  the  last  term  in  (3.?)  until  later,  suppose 
Tv  has  cne  or  more  sides  lying  on  6r  and  denote  that  segment  of  5r 


6? 


by  dR  .  Then  we  have 


(3.8)  I^[v]  =  J  (a  6v  +  a^,v)ds 

8rv 


and  again  using  (3*6),  we  obtain  the  following  quadratic  function  from 
the  first  term  in  the  integrand  of  (3.8): 


(3-9) 


T  -Tf 

iv°v  <  I 


T  \i  \s 
a^xpq?  ds  C  q 


We  will  denote  the  sum  of  the  matrices  in  braces  in  (3.7)  and  (3*9) 
by  Hv  .  The  so-called  stiffness  matrix  is  then  given  by 
-T  -1 


(3.10) 


AV  =  CV  HVCV  , 


T 

and  the  quadratic  terms  of  I^[ v]  yield  the  function  qv  Avqv  . 


Turning  now  to  the  linear  terms  in  I^[v]  and  I^[  v]  we  obtain, 


using  exactly  the  same  procedure,  the  expression 


T  -T 


(3.11)  qv  Cv  l  a  a,xp  dx  dy  +  J  a^q>  ds 

'  8rv 


Denoting  the  vector  in  braces  by  wv  ,  the  linear  terms  in 
iV)  =  l£[v]  +  iy  v]  yield 


T  -T  T 

(3.12)  qVCV  wV=qVbV 


where  the  vector  bv  is  usually  referred  to  as  a  load  vector  by  engineers 


Repeating  the  above  proced.  re  for  each  triangle  T  ,  v  =  1,2, . ..,N^ 


we  obtain  finally 


N 

(3.13)  I[v]  =  ^  (qvA  V+qvV)  , 

where  we  note  that  there  will  be  parameters  qY  common  to  more  than  one 

J 

of  the  terms  of  the  summation. 

If  we  assume  all  our  boundary  conditions  are  natural  (i.e.,  they  are 
satisfied  automatically  because  of  the  design  of  the  functional  being 
minimized),  then  (3*3)  is  null,  and  our  approximate  solution  is  obtained 
by  minimizing  (3.1p)  with  respect  to  the  qv,s  .  That  is,  we  satisfy 

WA  f  T  ^ 

(3.14)  £  |^(AV  +  AV  )qv+bv  J  =  0  . 

If  v  must  satisfy  some  boundary  conditions  of  the  form  (3*3),  then 
some  of  the  q^'s  are  constrained  to  assume  certain  values  or  satisfy 
certain  relations.  This  entire  assembly  problem  and  incorporation  of 
boundary  conditions  is  examined  in  the  next  section. 

We  now  examine  the  details  of  implementation  of  the  procedure  outlined 
in  (3.6) -(3 .12; .  To  reduce  the  amount  of  computation  that  must  be  done 
for  each  triangle,  it  is  convenient  to  confine  as  much  of  the  computation 
as  possible  to  a  standard  canonical  triangle  T°  for  which  part  of  the 
computation  can  be  done  once  and  for  all.  The  savings  that  can  be  realized 
depend  rather  heavily  on  whether  the  coefficients  of  the  functional  a^  , 
i  =  1,2, ...,7  are  constants  or  variable.  Tne  following  scheme  has  bean 
described  for  particular  problem- element  combinations  by  Zlamal  [Z5], 

Dupuis  and  Goel  [  D3]  and  others. 

Let  T°  have  vertices  (0,0),  (1,0)  and  (0,1)  .  Then  the  linear 
transformation  mapping  T°  (|-rj  plane)  onto  Tv  (x-y  plane)  having 
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vertices  (x^y^)  i  =  1,2,3  is 


where  =  xi+1~x^  with  the  subscripts  interpreted  modulo  3*  The 
inverse  mapping  is  then 

_  _1_  {  -W3  “3  V  x  Y  _i_  /  'Vl  -  &jxl  \ 

|JV|  n^)\  y  )  ljVl  \AXri  '  tyixi/ 

where  |jvj  is  the  determinant  cf  Jv  .  Note  that 

(3.17)  Jf  w(x,y)dxdy  =  w(x(|),y(n))|jv|  d|dri  . 

Now  define  the  quantities  q  ,  cp  ,  and  C  on  T°  (in  the  £-t)  plane) 
in  exactly  the  same  way  as  their  counterparts  were  defined  in  the  x-y 
plane.  For  example, 

(3.18)  <p(5)  =  (1,£,t),!2,6i,.  It)2,  6^, i“ih £t]2, rf5) 

Using  (3.16)  and  (3-17),  the  integrals  (3. 5)  and  (3.8)  can  be 
expressed  in  the  form 

(3.19)  ff  (gjjw|  +  g2wcWT]+  SjW^  +  gjjw2  +  g^v)  |jv|d?dn  , 

T° 

and 

(3-20')  J  (g^w2+  g^w)  |jv|ds  , 

8t° 
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where  w(|,t])  =  v(x(|,  rj)  ,y(|,  i]))  ,  and  the  g.'  ’s  will  depend  not  only- 
on  their  respective  rs,  but  also  upon  the  mapping  and  the  other  terms 
in  the  functional.  The  contour  integral  (3.20)  is  understood  to  apply 
only  to  the  part  of  T°  corresponding  to  &RV  .  Again  it  is  convenient 
to  collect  the  linear  and  quadratic  terms  together.  Carrying  out  the  above 
procedure  for  the  quadratic  terms  in  (3-19)  and  (3.20)  we  obtain 


(3.21) 


The  expression  in  braces  in  (3-21)  is  the  f--T}  counterpart  of  the 
matrix  Hv  defined  above,  and  we  denote  it  by  HV  .  Then  the  |-t] 
counterpart  of  Av  is  given  by 

(3.22)  Av  =  C"T  Hv  C'1 


The  linear  terms  of  (3-19) -(3-20)  yield 


(3-23)  qT  C"TJ  JJ  g^  cp|jv|  d|dr)  +  Jg^  cp  |jv|d^, 


T 


dr° 


or 


(3.24)  qT  C"T  w  =  qT  b 


Finally,  since  we  wish  Ivlv]  to  be  expressed  in  terms  of  the 
parameters  in  the  x-y  plane  rather  than  the  £-t]  plane,  we  must  apply 
a  transformation  derived  from  (3*15)  Av  and  b  .  Specifically,  using 


(^.15) ,  we  can  easily  construct  a  block  diagonal  matrix  K  satisfying 

(5-25)  q  =  Kvqv  , 

from  which  we  can  get,  by  substitution  of  (5.25)  into  (5.22)  and  (5.24), 
the  following 

(5.26)  AV  =  /  IV  ,  bV  =  T?  bV 

The  following  points  are  Important  in  the  implementation  of  the  above. 

(i)  If  the  coefficients  of  the  quadratic  terms  in  the  functional  are 
constants  (or  at  least  constant  over  each  triangle),  then  the  corresponding 
gv  's  will  be  constant  over  the  triangles.  Thus  Av  can  be  expressed  as 
the  sum  of  matrices  of  the  form  7]_Gp+7gG 2+‘y5G5  +  74G4  *  where  the  Gj,*s 
are  independent  of  v  ,  (and  thus  need  to  be  computed  once),  and  y^  =  y . (v) 
For  example,  the  first  term  would  be 

(5.27)  g£  lJVl  C'T  ff  $g  d|  dj]  C-1 

7x(v)  g-l 

The  generation  of  the  G^’s  can  be  done  very  efficiently  as  follows. 
First  we  compute 

(5.28)  <9...  =  JJ  |V«cLti  =  i*.i '/  (i+j+2)  l  , 

T° 

for  all  i  and  j  less  than  p  ,  where  p  depends  upon  d  and  the  terms 
in  the  functional.  The  components  of  the  integral  are  then  I  ,  where 
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I 


rn  and  are  simple  integer  functions.  When  is  not  constant  over 

each  triangle,  numerical  integration  will  probably  be  necessary  to 
evaluate  the  expressions  in  the  braces  in  (3.21)  and  (5.23) •  Even  in  this 
instance,  having  the  basis  functions  in  the  form  (2.l4)  is  still  very- 
convenient,  since  it  allows  us  to  compute  the  integrand  at  the  evaluation 
points  very  efficiently.  For  example,  consider  evaluating  the  i, j-th 
component  of  the  integrand  of  the  first  bracketed  integral  in  (3*21)  at 
the  point  ( §  ,  tj  )  .  The  function  to  be  evaluated  will  have  the  form 


1,-2  l 

(3.29)  g l 


V 1  VI  *1  lO~ 2 
V  g2  V  V  g3  V  \ 


,  v  r  1  2 

+  %  h  v  > 


Assuming  we  have  the  basis  functions  in  a  convenient  symbolic  form, 
the  evaluation  of  the  integrand  can  be  optimized  considerably  by  precomputing 

V2  V2 

the  common  factor  |  n 

(i  V 


(ii)  The  matrix  C  and  its  UJ  decomposition  need  only  be  computed  once, 
since  C  is  independent  of  v  • 

(iii)  The  computation  done  so  far  has  been  independent  of  the  boundary 
conditions  (5.13).  Thus  a  change  in  them  would  net  require  re-computation 
of  the  Av  and  bv  ,  v  -=  1,2,  ...,N^  .  Also  note  that  changes  in  a^ 


and 


®7 


would  not  change  A,  v  =  1,2, ...,N 


i  \ 


l: 


0 
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(iv)  Consider  the  calculation  represented  by  (3.22),  and  denote 

by  n  .  Normally,  one  would  expect  the  congruence  transformation  to 

3  2 

require  2n  +  0(n  )  multiplicative  operations,  since  we  need  to  perform 

2 

2n  back-sol ves,  each  requiring  n  +  0(n)  operations.  We  will  show  how 

7  3  2  ~\) 

to  reduce  the  computation  to  j-  n  +  0(n  )  under  the  assumption  that  H 

b  „  T 

is  symmetric.  [Equation  (3 •  1*0  above  implies  that  we  only  need  Av+Av  ; 

__T  „  „  T  „_1 

therefore,  if  Hv  is  not  symmetric,  we  can  compute  C  (Hv  +  Hv  )C  ]  . 

The  following  technique  has  also  been  used  in  [M2]  in  connection  with 
solving  generalized  eigenvalue  problems. 

~T 

Suppose  we  have  the  HJ  decomposition  of  C  .  Then  the  basic 
procedure  is 

(a)  Solve  LU  W  =  Hv  , 

(b)  Solve  LU  AV  =  WT  . 

Consider  step  (a) .  Suppose  we  compute  only  the  lower  triangle  of  W 


i.e.,  we  do  not  complete  the  U-solve,  so  that  W  has  the  form 


$ik)  • 


It  is  easy  to  show  that  now  the  calculation  of  W  requires  the  following 
number  of  multiplicative  operations: 

^  n2  ,  £  i(i+l)  r?  ,  x?  ,  2N  2  3  ,  2, 

°a  =  n  '  T  +  ~2  ~  =  T  +  T  T  0(n  }  =  3  n  +  °(n  )  • 


Now  consider  step  (b) .  We  use  the  following  notation  to  indicate  partitions 
T 

of  L  ,  U  and  C  ,  where  the  upper  left  partition  is  k  by  k  : 


I  rk  'X 
'1  I  2  1 


ck  I  ck 
C3  |  4 


r 


uk  1  uk 

U1  I  L2 

I  ,.k 
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We  will  denote  the  i-th  column  of  Av  by  ai  ,  its  first  k  elements 

k  k' 

by  a"  ,  and  its  last  n-k  elements  by  a..  .  The  first  i  elements  of 

the  i-th  row  of  W  will  be  denoted  by  w^  .  Then  step  (b)  can  be  described 

as  follows:  For  k  =  n,n-l,  ...,1  compute 


k  k  k 

L1  U1  \ 


-  w. 


k* 

a, 

K 


The  first  step  yields  the  last  row  and  column  of  A  ;  the  next  step  yields 

tne  remaining  unknown  parts  of  the  (n-l)-st  row  and  column  and  so  on.  Note 

kf 

that  at  each  stage  the  vector  a^  has  already  been  computed  by  previous 

k  k  k 

steps.  Here  we  use  the  fact  that  .  The  number  of  multiplicative 

operations  required  for  step  (b)  is  given  by 


a 


-  Z 

i=l 


n  2  n 
i  + 


k 


i(n-i) 


0(n2) 


Thus*  the  total  computation  required  for  the  congruence  transformation 

has  been  reduced  from  2n^+0(n  )  to  0^4  0^  =  -g  n  +  0(n  ) 

When  the  coefficients  of  the  quadratic  terms  are  constants,  this 

technique  Will  not  be  too  important  since  the  number  of  such  congruence 

transformations  will  be  small.  The  computation  of  the  G  matrices 

discussed  above  is  initialization,  and  for  »  n^  ,  the  work  required 

2 

for  equation  generation  is  essentially  proportional  to  N^n^  .  However, 

if  one  or  more  of  the  quadratic  coefficients  is  variable,  a  congruence 

transJ ormation  must  be  done  for  each  triangle,  and  using  this  technique 

5  3 

saves  ^  n^  multiplicative  operations. 

The  equation  generation  can  be  summarized  as  follows: 
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Step  1  (Initialization) 

(i)  Compute  C  and  its  UJ  decomposition. 

(il)  If  all  the  quadratic  terms  have  constant  coefficients  then 
compute  the  appropriate  G  matrices  and  store  them. 

Step  2 

For  each  triangle  T“  do  the  following: 

(iii)  Compute  the  mapping  from  T°  to  Tv  >  and  generate  the  quantities 
jjvj  and  g^  . 

(iv)  Generate  Av  and  bv  . 

(v)  Apply  the  transformation  KV  to  Av  and  bv  to  obtain  Av 
and  bv  . 


4 .  Assembly  of  the  Equations 


Having  completed  the  procedure  described  in  Section  3-3  for  each 
triangle,  we  have  a  system  of  the  following  form  to  solve: 


Combining  the  terms  in  (4.1),  and  renumbering  the  ls  and  b^  *s  from  1 
to  N  ,  we  obtain  the  system 


(4.2) 


Aq  =  b  . 


As  we  pointed  out  in  the  previous  section,  if  boundary  <  onaitions  of 
the  form  (3-3)  are  imposed,  then  some  of  the  elements  of  q  will  be 
required  to  assume  specific  values  or  satisfy  specific  relations. 

Suppose  first  that  the  boundary  conditions  only  impose  constraints  on 
single  parameters,  rather  than  specifying  relations  that  must  hold  between 
several  parameters.  Partitioning  q  into  q^  and  q2  ,  equation  (4.2) 
can  be  written  in  the  form: 


(^.3) 


A11  A12 


VA21  ^22  J 


Now  if  q2  must  satisfy  q2  =  q0  ,  we  can  solve 


(4.4) 


Allql  bl  “  A12q2 
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As  Felippa  and  Clough  [  Fl]  point  Out,  in  order  to  avoid  rearranging 
equations,  we  would  actually  solve  th^  following  system  in  some  permuted 
form 

m  ODCD-Gf" 

We  shall  see  in  Chapter  4  that  if  we  use  "profile"  methods,  this  practice 
costs  almost  no  storage  or  computation.  We  denote  this  system  by 

(4.6)  A'q*  =  b*  • 

Now  suppose  further  that  the  boundary  conditions  impose  some  general 
linear  constraints  on  the  solution  of  (4,6) .  As  an  example,  we  appeal  to 
our  cubic  element  3-4  and  the  diagram  below: 


»  x 


Figure  4.1 

Suppose  the  boundary  condition  ^  =  g(x,y)  is  imposed  along  dR  . 

Then  at  the  point  Q,  ,  we  want  to  impose  the  condition  (4.7)  on  the  derivative 
parameters  at  the  node  Q,^  . 


(^•7)  "W  Sil1  01  +  Vy(Ql)  C0S  “  =  6  (^0  ' 

If  our  boundary  conditions  impose  l  such  constraints  (where 
l  «  N  in  general),  we  can  write  them  as  an  l  xN  matrix  equation 


(4.8)  Qq'  =  c  . 

The  solution  of  (4.6)  can  be  viewed  as  the  point  which  minimizes  the 
quadratic  function 

(4.9)  $(q‘)  =  |  q,TA'q*  -q‘Tb'  . 

Using  the  method  of  Lagrange  multipliers  to  minimize  (4.9)  subject  to  the 
constraints  (4.8),  we  obtain  the  following  system  to  solve 


(4.10) 


C)  - 


where  X  is  a  vector  of  l  Lagrange  multipliers.  The  algorithm  for 
solving  (4.10)  is 

m 

a)  Solve  A'W  =  Q  and  compute  Y  =  QW  (and  its  IXJ  decomposition) 

b)  Solve  A’y  =  b’ 


(4.11) 


c)  Solve  Y X  =  Qy-c 

d)  Solve  A'q*  =  b'-QT\ 


At  first  sight  this  algorithm  appears  expensive,  since  1+2  solutions 
of  systems  of  the  form  (4.6)  are  involved.  However,  if  the  coefficients 
in  (3.3)  are  constants,  Y  remains  constant  for  different  boundary 
conditions.  Thus,  in  such  circumstances,  our  problem  may  be  solved  using 
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steps  (b),  (c)  and  (d)  once  the  LU  decomposixion  of  Y  is  available. 

Another  possibility  for  handling  boundary  conditions  of  the  form 
(4-7)  is  to  modify  the  parameters  of  our  problem.  Applying  the  proper 
orthogonal  transformation  to  the  stiffness  matrices,  we  rotate  the 
derivative  parameters  to  get  v  and  v  .  The  boundary  condition  then 
imposes  a  condition  only  on  v  ,  and  the  variable  can  be  eliminated  in 
the  obvious  way. 

Which  of  the  two  approaches  for  handling  derivative  boundary 
conditions  is  better?  It  is  fairly  clear  that  the  latter  approach  will 
generally  require  less  computation,  since  the  rotations  which  must  be 
applied  are  relatively  inexpensive  and  each  one  saves  a  solution  of  the 
system  (4.6).  For  very  large  systems,  the  difference  between  the  two 
computations  will  be  great.  Furthermore,  the  relative  difference  between 
the  work  required  to  decompose  A'  and  that  required  for  a  back  solution 
is  not  as  large  for  band  systems  as  for  dense  systems,  since  the  factor 
is  the  bandwidth  rather  than  N  .  Thus  our  remark  above  that  A’  need 
only  be  decomposed  once  is  not  as  important  as  you  would  expect. 

In  supoort  of  the  first  method  of  treating  derivative  boundary 
conditions  is  its  simple  and  uniform  implementation.  The  computation  can 
be  isolated  in  one  subroutine  which  generates  the  matrix  Q,  .  In  contrast, 
the  second  approach  is  very  complicated.  Corners  having  interior  angles 
which  are  not  multiples  of  tt/2  may  force  us  to  apply  non-orthogonal 
transformations  to  the  derivative  parameters  in  order  to  handle  boundary 
conditions  imposed  on  both  incident  edges.  The  fact  remains,  however,  that 
such  complexity  pays  off.  For  typical  problems  (and  a  one  shot  computation) 
the  first  approach  can  require  twice  as  much  computation  as  the  second. 
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5.  Inclusion  of  Singular  Functions  in  the  Basis 

For  some  elliptic  boundary-value  problems,  particularly  in  domains 
with  re-entrant  corners,  the  solutions  may  have  unbounded  derivatives  at 
some  of  the  corners,  so  they  are  hard  to  approximate  by  polynomials.  A 
successful  approach  due  to  Fix  [F2]  is  to  enrich  the  basis  by  adding 
appropriate  "singular"  functions  that  represent  the  solution  accurately 
near  the  corners. 

Fix  employed  tensor  product  spaces  rather  than  the  interpolation  method 
for  generating  the  finite-element  equations  (the  distinction  between  the 
two  approaches  was  made  in  Chapter  3)  •  Thus,  once  he  had  designed  the 
appropriate  singular  functions  having  small  support,  the  inclusion  of 

them  in  the  basis  was  straightforward.  The  extra  terms  were  simply  added 

.  ..  N 

to  tne  expansion  lor  v  . 

The  inclusion  of  such  singular  functions  is  still  possible  with  the 

interpolation  approach,  but  the  procedure  is  not  quite  so  obvious.  Suppose 

* 

we  wish  to  include  one  singular  function  V  in  the  basis,  and  assume 
that  V*  j=  0  on  triangle  Tv  .  We  will  ignore  the  compli.cation  of  the 
mapping  of '  Tv  onto  the  canonical  triangle  T°  .  Using  the  notation  we 
developed  in  Section  1  of  this  chapter,  we  consider  the  computation 
involving  the  following  term  on  triangle  Tv  : 

(5.1)  JJ \dxdy  . 

TV 

We  first  note  that  the  basis  functions  on  Tv  under  "normal"  circumstances 
are  given  by  'Jr)>  ,  i  =  1,£,3, . .  .,ra  ,  where 
-T 

(5.2)  ^  =  {Cv  <p]. 
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The  approximation  to  u  on  Tv  is  thus  given  by 


m  T  -T 

(5-3)  v(x,y)  =  £  qJJ  =  qV  CV  <p 
i=l  1 


In  this  form  it  is  clear  how  to  add  the  singular  function.  Including  the 
singular  function  in  the  sum  of  (5*3)  and  going  in  reverse  we  have: 


(5.U)  v(x,y)  =  £  qj*  +  q* 
i-1  1  1 


The  expression  for  (5«l)  is  therefore 


(5.5) 


/  T  , 

^x'V  ^dy 


In  this  particular  example,  the  stiffness  matrix  for  Tv  will  be  (n+l) 
by  (n+l)  rather  than  n  by  n  .  The  extension  to  more  than  one  singular 
function  is  clear. 
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CHAPTER  k 

SOLUTION  OF  FINITE  ELEMENT  EQUATIONS 
1 .  Introduction  and  Notation 

In  thin  chapter  we  will  study  the  storage  solution  of  finite  element 
systems  of  equations.  As  we  pointed  out  in  Chapter  I,  the  NxN  finite 
element  coefficient  matrix  A  will  in  general  he  sparse;  that  is,  many 
(perhaps  most)  of  its  elements  will  he  zero.  To  say  that  a  matrix  is  sparse, 
with  no  further  qualification,  is  not  of  much  practical  significance.  What 
is  important  is  whether  we  can  make  use  of  its  sparseness  to  reduce  storage 
and/or  computation  requirements  in  its  subsequent  processing;  that  is,  we  are 
interested  in  whether  the  matrix  has  exploitable  structure  rather  than  .just 
its  sparseness.  One  of  our  aims  in  this  chapter  will  be  to  study  the  structure 
of  finite  element  equations  and  to  show  how  such  structure  can  be  utilized. 

In  this  connection  we  present  some  experiments  comparing  several  ordering 
algorithms  (i.e.,  algorithms  which  order  or  reorder  the  rows  and  columns 
of  A  with  the  aim  of  reducing  storage  and  computation  requirements).  We 
also  present  two  efficient  methods  for  storing  sparse  matrices. 

We  have  confined  our  attention  to  direct  methods  for  solving  finite 
element  equations  for  the  following  reasons: 

(l)  Storage  is  becoming  increasingly  abundant,  and  one  of  the  prime  reasons 

for  using  iterative  methods  is  that  they  generally  require  much  less  storage 

than  direct  methods.  Computer  memories  are  steadily  becoming  larger,  the 

capacity  and  performance  of  peripheral  storage  devices  such  as  disks  and 

drums  is  improving  rapidly,  and  large  bulk  core  storage  [f8]  (which  can  be 

viewed  as  a  very  fast  peripheral  storage  device)  is  becoming  common.  The 

use  of  virtual  memory  [ DI ,  M5 ]  is  another  important  development .  Under  ideal 

2k 

conditions,  the  user  is  allowed  to  address  a  very  large  memory  («  2  words 
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on  the  IBM  360/67)  which  need  not  exist  physically  but  where  addresses 
are  automatically  mapped  onto  actual  physical  addresses  during  execution. 

We  do  not  mean  to  imply  that  storage  is  not  an  important  consideration  in 
the  choice  of  methods;  our  contention  is  simply  that  the  characteristics  of 
today's  computer  systems  allow  the  solution  of  large  linear  systems  with 
direct  methods. 

(2)  Finite  element  methods  tend  to  yield  denser  systems  of  equations  than 
usual  finite  difference  methods.  Suppose  the  parameter  q^  is  associated 
with  node  i  .  Then  there  will  usually  be  a  non-zero  entry  in  q.  ,'s  equation 
for  every  parameter  associated  with  every  triangle  containing  node  i  .  It 
is  easy  to  see  that  higner  degree  polynomials  must  lead  to  denser  systems, 
because  more  parameters  will  be  associated  with  each  triangle.  We  discuss 
this  subject  in  detail  in  Section  6  of  this  chapter.  Since  the  amount  of 
computation  per  iteration  for  most  iterative  schemes  is  proportional  to  the 
number  of  non-zero  elements  in  the  matrix,  this  increased  density  increases 
the  solution  time  for  iterative  methods.  [However,  for  fixed  N  , 
higher  degree  polynomials  yield  systems  which  require  more  computation 
for  their  direct  solution  also,  so  it  is  difficult  to  make  precise 
statements  as  to  which  methods  require  the  least  computation.]  Fix  and 
Larsen  [ F3 1  have  compared  Gaussian  elimination  and  successive  over¬ 
relaxation  (DOR)  for  some  special  tensor-product  spaces,  and  their  analysis 
and  numerical  experiments  suggest  that  SOR  is  more  efficient  for  some 
problems,  if  N  is  large  enough.  Their  conclusions  are  based  on  the 
assumption  that  the  equations  have  only  one  right  side,  and  in  many  practical 
situations,  this  is  unlikely.  Also,  their  analysis  is  based  solely  on 
operation  counts.  For  tensor-product  bases  such  an  analysis  is  reasonable, 
since  the  structure  of  the  grid  and  the  coefficient  matrix  can  conveniently 
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be  stored  in  two- dimensional  arrays.  The  data  management  is  no  more  complex 
than  "Chat  resulting  from  using  a  five  point  difference  operator  on  a  regular 
mesh.  However,  for  an  arbitrary  triangular  mesh,  A  will  not  have  such 
regular  structure,  and  the  calculation  of  a  single  component  of  the 
residual  vector  may  be  relatively  expensive.  In  general,  A  will  be 
symmetric  and  only  it’s  upper  or  lower  triangle  will  be  stored;  therefore, 
in  order  to  compute  a  single  component  of  the  residual,  we  must  be  able 
to  access  lines  of  elements  in  both  rows  and  columns  of  the  upper  (or  lower) 
triangle  of  A  .  If  the  storage  scheme  is  "row  oriented",  accessing  elements 
in  a  specific  column  may  require  scanning  several  rows,  and  visa  versa  for 
column-criented  schemes.  By  contrast,  elimination  schemes  can  be  conveniently 
implemented  so  that  they  operate  only  on  rows  or  only  on  columns.  We  discuss 
this  subject  in  detail  later;  our  point  is  that  data  management  can  be 
important  in  comparing  methods. 

(3)  Finally,  and  perhaps  most  important,  a  rather  large  amount  of  practical 
engineering  experience  indicates  that  direct  methods  are  preferable  to 
iterative  ones.  The  reasons  for  this  include: 

(i)  Finite  element  systems  (designed  to  yield  a  prescribed  accuracy) 
tend  to  have  a  considerably  lower  order  N  than  systems  resulting  from 
usual  finite  difference  methods.  This  is  due  in  part  to  the  ease  with 
which  we  can  grade  the  net  (thus  making  efficient  use  of  each  degree 

of  freedom) .  Also,  as  we  shall  see  in  Chapter  increasing  the  degree 
d  of  our  piecewise  polynomial  allows  us  to  decrease  K  and  still 
obtain  the  prescribed  accuracy. 

(ii)  Direct  methods  allow  the  use  of  iterative  refinement  [F4,  W2], 
which  provides  an  estimate  of  the  condition  of  the  discrete  problem  and 
the  accuracy  of  the  discrete  solution.  Such  information  is  hard  to 


obtain  using  iterative  methods.  Since  we  do  not  know  the  true  (discrete) 
solution,  the  error  at  each  step  of  the  iteration  must  be  estimated  on 
the  basis  of  such  measurable  quantities  as  the  size  of  the  residuals  or 
the  size  of  the  last  correction  vector.  Unfortunately,  small  residuals 
or  small  changes  in  successive  iterates  do  not  guarantee  small  errors  in 
the  computed  solution.  By  using  direct  methods,  we  also  avoid  the 
problem  of  finding  a  "good"  over-relaxation  parameter. 

(iii)  Usually,  more  than  one  right  side  must  be  processed.  The  initial 
cost  of  the  decomposition,  which  represents  the  majority  of  the  computation 
for  the  first  solution,  does  not  have  to  be  repeated  for  succeeding 
right  sides. 

The  study  of  sparse  matrix  problems  is  a  rapidly  expanding  field. 

(See  Willoughby  [W3],  and  the  extensive  references  therein.)  In  the  sequel, 

we  will  assume  A  is  a  symmetric  positive  definite  matrix.  As  we  observed 

in  Chapter  I,  finite  element  methods  for  elliptic  problems  cast  in  a 

variational  form  yield  this  type  of  matrix.  Following  Rose  [R3]  and  Cuthill 

and  McKee  [C5],  A  will  be  said  to  have  bandwidth  m  if  and  only  if 

a_  f  0  =*.|i-j|  <  m  .  Note  that  this  differs  from  the  usual  definition  of 

bandwidth,  which  is  defined  in  terms  of  m  to  be  2m+l  .  For  any  matrix  W  , 

we  define  the  quantities  ff  =  min(j|w.  f  0}  ,  i  -  1,2, ...,N  ,  and 

6^  =  i-f.  .  Thus,  m  =  maxfb^}  .  The  number  N^  will  denote  the  number 
11  ^1  Z 

of  non- zero  elements  in  W  . 

Rose  [R3]  has  given  a  detailed  graph  theoretic  analysis  of  the  Cholesky 
decomposition  algorithm.  With  Rose  we  define  the  graph  G  =  (X,E) 
associated  with  A  ,  where  X  and  E  are  sets  of  nodes  and  edges, 
respectively.  Vertices  correspond  to  rows  of  the  matrix,  and  edges 
correspond  to  non -zero,  off-diagonal  elements  of  A  .  If  i  >  j  and 


86 


a. .  /  0  ,  then  vertex  i  is  joined  to  vertex  j  by  an  edge.  (We 
A  J 

then  say  that  vertices  i  and  j  are  adjacent.)  The 
degree  of  a  vertex  is  the  number  of  edges  incident  to  it.  An  example 
demonstrating  this  notation  appears  below.  An  "X"  indicates  a  non-zero 
element,  and  a  ,!0"  indicates  a  zero  element. 


(1.1)  A  4 


/h  0  X  X 
0  X  X 
X  X  X  0  X  0 
X  0  0  X  0  0 
0  X  X  0  X  X 
0  X  0  0  X  X 


v; 


X  0  o\ 
0  X  X  \ 


■ 


The  ordering  of  the  equations  induces  a  corresponding  ordering  of  the 

vertices  of  G  .  In  general,  we  denote  an  ordering  a  on  X 
(X  ex 

( [1,2,3?  ••  -,N}  -•  X)  by  G  .  Denoting  the  set  of  vertices  adjacent  to 

vertex  i  by  7^  ("neighbours"  of  vertex  i)  ,  we  can  describe  the 

T 

Cholesky  decomposition  of  A  into  LL  by  a  sequence  of  elimination 
graphs  [R3 ]  G  =  GQ,G^,G2, . .  .,GfI_1  ,  where  G^  is  obtained  from 

Gi  1  ky  deleting  vertex  i  and  its  incident  edges  and  adding  edges  so  that 
the  vertices  of  7?^  are  pairwise  adjacent.  Using  our  example  above, 
we  have: 


v  ©— ©  .  v  ©  • 

The  zero/non-zero  structure  of  L  is  thus  given  by 
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A 

x 

X  X 
X  X 

X  X  X  X 
X  X  X  X  X  J 

The  munber  of  edges  added  during  elimination  is  usually  referred  to  as 
the  fill-in,  and  is  simply  the  difference  oetween  and  the  number  of 

non-zero  elements  in  the  lower  triangle  of  A  ,  including  the  diagonal. 

Rose  [R3]  points  out  that  the  fill-in  will  be  zero  iff  for  all 
N>i>j>k>0,  (a„  /  0  A  aik  f  0)  =>  a..^  /  0  .  He  shows  that  L  must 
have  this  property  (if  we  ignore  the  occurrence  of  accidental  zeros),  and 
calls  matrices  having  this  property  perfect  elimination  matrices. 

An  element  a. .  ,  i  >  j  will  be  said  to  lie  in  the  profile  of  A 
(a.,  ePr(A))  if  fj  <  j  <  i  .  Hence  a.  .  f  0  =*  a.  .  e  Pr(A)  ,  but 

1  1J 

a. .  e Pr(A)  a.  .  /  0  .  This  is  a  simple  but  important  generalization  of  the 
!<3 

concept  of  bandwidth.  Observe  that  Pr(A)  =  Pr(L)  .  We  will  denote  the 

number  of  elements  in  Pr(A)  by  {Pr(A) |  .  Thus  A  is  sparse  if 

|Pr(A)  j  is  significantly  less  than  N  ,  even  if  m  =  N  .  Obviously, 

N 

|Pr(A)  j  =  £  (&i  +  1)  • 

L=1 

T 

Now  the  decomposition  of  A  into  LL  is  unique j  however,  the  amount 
of  computation  done  to  obtain  L  will  depend  on  the  structure  of  A  ,  and 
how  carefully  we  take  advantage  of  it.  Suppose  A  is  NxN  with  bandwidth  m 
Then  treating  A  as  a  dense  band  matrix,  it  is  easy  to  show  that  the  number 
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of  multiplicative  operations  required  to  compute  L  is  approximately 

.  We  will  refer  to  the  algorithm  as  the  "band 

Cholesky  (BC)  decomposition  algorithm". 

Suppose  now  that  <  m  for  at  least  one  i  ,  and  we  take  advantage 
of  this  fact.  The  following  theorem  gives  the  number  9p  of  multiplicative 
operations  required  to  compute  L  ,  if  we  consider  A  and  L  as  having 
dense  profiles . 

Theorem  1.1 

Then  the  number  9p  of  multiplicative 
is  given  by 


In  addition,  N  square  root  operations  and  9p-N  additions  are  required. 


Let  f^  be  as  defined  above, 
operations  required  to  compute  L 


(1-3) 


N  &*(8*+3) 

9P  =  C  - 

r  i=2  * 


Proof; 

Let  us  denote  the  elements  of  L  by  £_  and  consider  the  computation 
of  the  i-th  column  of  L  .  The  element  £ . .  is  computed  using  the  formula 


(1.4) 


which  requires  .5^  = 
root  operation.  The 


1-1 

[a. .  -  YZ 
11  *-1  1.3 


2  -,1/2 


i  -  f^  multiplications,  5^  additions  and  a  square 
elements  ,  k  =  f^,f^+l, . . .,  i-1,  ,  are  computed  using 

IK  X  X 


(1-5) 


£. 
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1J 


kj 3 '  xi 


89 


which  requires  6^(8.+l)/2  multiplicative  operations  and  S^(&^-l)/2 
additions.  Summing  over  i  yields  (1.3).  This  method  will  he 
referred  to  as  the  "profile  Cholesky  (PC)  decomposition  algorithm". 
The  following  is  obvious: 


Proposition  1.1 

For  any  ordering  of  A  ,  we  have  9^  <  9^  . 


Finally,  suppose  we  are  prepared  to  take  advantage  of  every  non-zero 
element  in  A  and  L  ;  that  is,  we  will  operate  only  on  those  elements 
which  are  actually  changed  by  the  elimination  process.  Let  d_.  be  the 
degree  of  the  i-th  vertex  in  the  elimination  graph  1  .  Then  we  have 


Theorem  1.2  (Rose  [R3]) 

The  number  of  multiplicative  operations  9&  required  to  compute  L 
is  given  by 


(1.6) 


N-l 

i=l 


d. 


l 


(d.+3) 

_ 


N-l  d.  (d.+l) 

An  additional  N  square  root  operations  and  . ~2 - -  addition 

i=l 

operations  are  required. 

The  reader  is  referred  to  [R3]  for  the  proof  of  (1.6).  This 
algorithm  will  be  referred  to  as  the  "graph  Cholesky  (GC)  decomposition 
algorithm" . 

Now  we  must  consider  the  tradeoff  between  the  amount  of  computation  and 
storage  required  by  the  different  algorithms  and  their  relative  complexity. 
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Note  that  the  graph  theoretic  analysis  of  elimination  implicitly  assumes  that 
we  are  prepared  to  take  full  advantage  of  the  structure  of  A  ;  thus,  for 
these  results  to  be  relevant,  we  must  employ  a  very  sophisticated  program, 
such  as  that  of  Gustafson  et  al  [ G3 ] *  [In  our  6  by  6  example  above,  we 
must  detect  and  make  use  of  the  fact  that  =  0  .  ]  Hence,  for  the  GC 

algorithm  to  be  worthwhile,  L  must  have  a  significant  number  of  zero 
elements  within  its  profile,  and  it  has  been  our  experience  that  the  L’s 
derived  from  finite  element  coefficient  matrices  do  not  have  sparse  profiles. 
(See  Section  for  some  numerical  experiments  in  support  of  this  claim.) 
Therefore,  we  have  confined  our  studies  to  the  BC  and  PC  algorithms.  We 
should  emphasize  that  our  decision  is  based  only  on  empirical  evidence; 
just  how  dense  Pr(L)  must  be  over  all  orderings  appears  to  be  an  open 
question,  even  for  piecewise  linear  polynomials  on  a  square  regular  right 
triangular  mesh. 

So,  in  summary,  we  have  chosen  for  various  reasons  to  limit  our 
attention  to  direct  methods  for  solving  finite  element  systems,  and  to 
look  at  no  more  of  the  structure  of  the  matrix  than  its  profile. 

Within  this  framework,  our  goals  are  to  reduce  storage,  reduce  computation, 
and  to  simplify  data  management.  These  goals  compete  with  one  another,  and 
the  characteristics  of  the  particular  computer  system  (hardware  and  software) 
will  have  considerable  effect  on  which  is  most  important. 

Finally,  in  the  sequel,  the  reader  should  keep  in  mind  that  f^  ,  6^  , 

Pr(A)  ,  9  ,  9  and  9  are  all  functions  of  the  ordering  a  of  A  . 

JT  vjr 

Thus  comparisons  between  such  quantities  should  be  understood  to  mean 
for  the  same  a  ,  unless  specifically  stated  otherwise. 
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2. 


As  in  the  previous  section,  let  us  denote  our  sparse,  symmetric, 
positive  definite  coefficient  matrix  by  A  ,  with  Cholesky  factorization 
LL"  •  When  piecewise  polynomials  of  degree  >  1  are  used,  the  matrix  A 
will  be  more  dense  than  that  resulting  from  usual  finite  differerce  schemes , 
Unfortunately,  its  profile  is  observed  to  become  only  slightly  more  dense 
with  increasing  degree.  Hence  it  is  advantageous  to  store  the  matrix  in  a 
compact  manner  to  save  storage.  It  is  important  to  keep  the  organization 
simple  to  allow  rapid  row  and/or  column  operations  on  the  matrix.  The 
prime  consideration  is  not  whether  we  can  randomly  access  a  particular 
element  of  the  matrix  efficiently  but  whether  we  can  efficiently  multiply 
the  matrix  by  a  vector  or  multiply  one  of  its  rows  by  a  vector. 

As  we  have  mentioned  before,  finite  element  coefficient  matrices  tend 
to  have  a  good  deal  less  uniformity  in  structure  than  those  arising  from 
traditional  finite  difference  methods.  Because  of  the  likelihood  of  graded 
nets  and  the  possibility  of  associating  more  than  one  parameter  with  each 
grid  point,  it  is  not  convenient  to  design  a  storage  scheme  based  on  the 
geometry  of  the  mesh  in  question.  This  is  in  contrast  with  most  storage 
schemes  for  difference  equations. 

Ideally,  the  number  of  storage  units  required  to  store  the  N  X  h 

symmetric  coefficient  matrix  A  should  be  equal  to  ,  the  number  of 

non- zero  elements  in  the  lower  triangle  of  A  (including  the  diagonal). 

While  it  is  obviously  possible  to  store  A  in  storage  locations,  the 

problem  is  to  find  an  efficient  mapping  function  that  allows  us  to  easily 

locate  element  a. .  .  In  this  section  we  describe  two  methods  for  efficiently 
1D 

storing  a  sparse  symmetric  matrix. 
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Method  1.  Let  v  be  a  vector  defined  by 
i 

(J.l)  V  =  ^  £  0}  ,  i  =  1,2,  ...,N  . 

j=l 

N 

Obviously,  Y  =  *  Let  f3 .  be  defined  by 

."n  X  V  X 

1=1 

i 

(5*2)  p.  =  ,  i  =  1,2,  ...,N  . 


The  non-zero  elements  of  the  i-th  row  of  the  lower  triangle  of  A  are  then 
stored  in  contiguous  locations  of  an  array  S  of  length  NQ  beginning  at 
S„  ™  and  ending  at  S,,  .  In  an  array  u)  ,  also  of  length  WA  ,  the 

h-i' 1  h  0 

corresponding  distances  of  the  elements  from  the  diagonal  are  placed.  Hence, 

if  6.  -  <  p  <  8.  ,  then  S  contains  element  a.  .  .An  example  is 
*1-1  -  Ki  p  i-u> 

useful  in  understanding  the  scheme.  Consider  the  following  15x15  matrix. 


6  4 

8  9 

1  4  6 

9  12  SYMMETRIC 

^  8  12 

2  4 

2  5  5 

16  8 

8  9  9 

L  7 

9  62 

6  8 

5  2  2 

7  b  L 
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Here  =  38  and  the  vectors  S 


and  uj  are  given  by 


8 

0 

6 

1 

4 

0 

8 

1 

9 

0 

1 

2 

4 

1 

6 

0 

9 

,  (D  = 

3 

12 

0 

4 

5 

8 

3 

1 

1 

2 

0 

• 

* 

• 

At  first  glance,  the  overhead  for  this  method  appears  prohibitively  high 
since  each  word  stored  requires  an  extra  word  to  store  its  "offset"  from 
the  diagonal.  However,  note  that  the  elements  of  ta  will  all  be  bounded 

by  m  ,  the  bandwidth  of  A  .  On  the  IBM  360,  for  example,  the  array  iu 

can  be  declared  as  short  integer  (2  bytes  =  16  bits),  whereas  the  elements 
of  S  may  be  4  or  8  bytes  long.  If  A  is  being  stored  in  double  precision, 
the  overhead  is  only  about  25  percent,  and  the  total  storage  required  is 
essentially  proportional  to  NQ  .  To  access  a  particular  element  a_ 
of  A  will  require  scanning  ^  -  p..  _1  elements  of  the  array  u>  .  Since 

the  elements  uu,  for  p.  ,  <  k  <  p.  are  ordered,  a  binary  search  can  be 

used,  so  the  amount  of  work  required  to  access  element  a.  .  would  be 

X  J 

proportional  to  log2(p^-p^  .  Even  for  rather  dense  bands  (resulting 
from  use  of  polynomials  of  high  degree),  this  is  very  satisfactory.  For 
example,  using  quintic  polynomials  on  a  typical  mesh,  we  would  need  to 
access  about  4  elements  of  u>  before  finding  a.  .  . 

X  J 

If  storage  is  very  scarce,  a  somewhat  more  efficient  scheme  is  the 
following : 
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Method  2.  Let  A  and  £  be  as  described  above,  and  define  the 


A 

vector  &  as  in  Section  4.1.  Let  A  be  stored  in  the  array  S  as  in 


Method  1,  but  instead  of  defining  the  array  uj  as  in  Method  1,  let 

N 

rfl 

(&.  +1)  .  Define  the  vector  p  by 
i-1  1 


(2-5) 


^  =  £  (s-‘+  ^  »  i  =  1,2,... ,N 

j=l 


Now  define  cu  by 


(2.6) 


4>. 


1  lf  ai,i~£  ^  °  V  .  _A 


0  if  a.  .  =  0 

i,i-£ 


,  l  <  5?  ,  i  =  1,2,  ...,N 


We  again  use  the  example  (2.3)  to  aid  in  understanding  the  scheme. 
The  arrays  p  and  u>  are  given  by 

pT  =  (1,  3,  5,  8,  12,  18,  21,  26,  30,  36,  1*0,  46,  50,  56,  60),  and 

0JT  =  (1111111.1100 110 1QUU-Q 31 0Q]_11101100011I£i011000111001100011101l)  . 

row  12  3  4  5  6  78  9  10  11  12  13  14  15 


Thus,  the  zero/non-zero  structure  of  the  i-th  row  of  the  lower  triangle 
of  A  is  stored  in  the  segment  of  m  beginning  at  o  ..  and  ending 

h-l+  1 

at  (u  .  The  storage  required  to  store  A  is  thus 
"i 


(2.7) 


(Nq  +  2N)  words  +  p^  bits. 


Note  that  the  storage  required  for  p  and  (3  becomes  less  significant 
with  increasing  N^  and  fixed  N  .  The  use  of  a  bit  array  may  cause  some 
program  overhead  (unless  the  machine  is  bit  addressable),  and  since  <u  is 
not  ordered,  up  to  p^  -  p^  ^  elements  of  10  will  have  to  be  examined  to 
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retrieve  element  a..  .  (Note  that  u.  -  u.  ,  maybe  »  6.  -8.  ,  .) 
Although  this  method  will  undoubtedly  require  considerably  more  program 
overhead  than  method  1  to  use ,  it  uses  extremely  little  unnecessary  storage. 
For  example,  using  this  method  on  an  IBM  360  computer  to  store  a  dense 
500  \  500  symmetric  matrix  in  short  precision  requires  less  than  4  percent 
moe  storage  than  the  usual  method  of  storing  a  triangular  matrix  in  a 
one-dimensional  array.  The  percentage  overhead  would  be  halved  if  the  array 
were  being  stored  in  double  precision. 

Thirdly,  we  present  a  method  due  to  Jennings  [jl]  which  is  applicable 
when  |Pr(W) |  «  j  that  is,  when  there  are  few  zero  elements  within 
Pr(W)  .  As  we  mentioned  before,  it  has  been  our  experience  that  the  L*s 
derived  from  finite  element  coefficient  matrices  have  this  property. 


Method  3  ("Profile  Storage  Scheme") 

Let  the  lower  triangle  (including  the  diagonal)  of  A  be  stored 

row  by  row  in  contiguous  locations  of  a  one-dimensional  array  S  .  Defining 

the  vector  p  as  in  method  2  above,  then  element  a. .  ,  i  >  j  is  given 

r  j 

by  S  ,  where  p  =  p.  -i  +  j  .  [Note  that  S  now  has  the  same  zero/non-zero 

P  ^ 

configuration  as  in  method  2.] 

This  method  obviously  applies  equally  well  to  storing  lower  triangular 
matrices,  and  it  is  primarily  for  this  reason  that  we  present  it.  The 
overhead  for  this  storage  scheme  is  only  the  storage  required  for  p  .  To 
store  L  ,  we  need  jPr(L)  j  +  N  words.  If  A  is  stored  in  this  manner, 

the  PC  algorithm  can  be  applied  "in  place"  and  no  temporary  storage  is 
necessary. 

Finally,  we  mention  the  most  commonly  used  method  for  storing  band 
matrices  [Ml],  which  we  will  refer  to  as  the  "diagonal  storage  scheme"  or 
simply  as  method  4.  The  diagram  below  describes  the  storage  layout: 
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The  required  storage  is  (m+l)N  ,  and  in  order  for  the  method  to  be 
efficient  we  should  have  m  «  N  . 

The  following  observation  is  of  practical  interest: 


Proposition  2.1 

Let  ^t,  -  (m+l)N  and  V  =  |Pr(L)j  +  N.  Then  for  any  ordering 

N 

of  A,  Vp  <  VB  +  N,  and  if  £  (m-5^  >  N,  then  Vp  <  . 


Proof: 

N 

Vp  =  N  +  iPr(L)!  =  N  +  £  (6.+1) 

i=l  1 

N 

=  N  +  T  5.  +  N 

iti  1 

N 

=  ( N+l )ra  +  N  -  £  (m-S.  ) 
i=l  X 

N 

=  VB  +  N  -  £  (m-6.)  . 
i=l 
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Thus,  Proposition  2.1  says  that  for  any  ordering,  the  storage 
required  for  method  3  cannot  exceed  that  required  for  method  4  by  more 
than  N  words.  In  practical  situations  we  have  found  that  Vp  is 
always  considerably  less  than  V  .  See,  for  example,  the  experiments 
in  section  4.5  and  the  analysis  in  section  4.7. 

Note  that  there  is  only  a  very  weak  relationship  between  |Pr(A)| 
and  m.  All  we  can  show  is 

(2.9)  N  +  m  <  |Pr(A)j  <  (m+l)N  -  m(m+l)  /  2  . 

Essentially,  (2.9)  says  that  for  a  fixed  m,  |Pr(A)|  can  vary  by  nearly 
a  factor  of  N. 
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3 •  Node  Ordering  for  a  Small  Bandwidth 


The  reasons  most  often  presented  for  reducing  the  bandwidth  of  a 
matrix  are  to  reduce  the  storage  and  computation  required  to  solve  the 
associated  linear  system  or  eigenproblem.  However,  these  reasons  are  valid 
only  if  we  plan  to  store  and  process  the  matrix  as  a  dense  band  matrix.  In 
view  of  Prop.  1.1,  Prop.  2.1  and  (2.9),  the  only  justification  for  ordering 
to  achieve  a  small  bandwidth  is  to  simplify  data  management.  In  this 
section  we  discuss  the  reasons  for  bandwidth  reduction  and  present  some 
algorithms  for  obtaining  small  bandwidth  orderings.  Note  that  the  question 
here  is  not  whether  we  should  use  the  BC  or  PI  algorithms  for  a  fixed 
ordering  a  ,  but  rather,  when  we  should  use  m  (instead  of  |Pr(A)  j 
or  9p)  as  a  criterion  (objective  function)  to  minimize  over  all  orderings 
a  of  A  . 

To  begin  with,  regardless  of  the  ordering  a  of  A  ,  if  A  is  symmetric 
and  positive  definite,  there  seems  to  be  no  reason  to  use  the  BC  rather  than 
the  PC  algorithm.  We  say  this  because  VU  <  (usually).  0  <  Q  ,  and  the 
computational  overhead  of  the  PC  over  the  BC  algorithm  is  negligible. 

However,  the  linear  system  we  want  to  solve  may  not  always  be  positive 
definite;  although  elliptic  problems  will  yield  positive  definite  matrices, 
many  methods  for  solving  the  associated  eigenproblem  involve  shifts  of 
origin  which  destroy  the  positive  definiteness  of  the  system  being  solved  [W2  ] . 
When  A  is  indefinite,  partial  pivoting  is  required  to  maintain  numerical 
stability,  and  the  profile  storage  scheme  is  no  longer  applicable  since  we 
are  now  computing  PA  =  HJ  for  some  (a-priori)  unknown  N  by  N  permutation 
matrix  p  .  The  only  storage  scheme  which  is  well  adapted  to  partial 
pivoting  is  method  4.  For  an  Algol  procedure  for  computing  the  LU 
decomposition  of  indefinite  band  matrices,  see  [Ml].  Thus,  in  this 
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A 

situation  in  =  in  is  important,  since  we  can  only  guarantee  that 
PA  A 

m  <  2m  ,  and  the  combined  storage  requirement  for  L  and  U  (using 

the  diagonal  storage  schene  of  Section  h.2)  is  therefore  (3ntf-l)N  . 

The  work  of  Bunch  [B13]  on  stable  decompositions  of  symmetric  indefinite 

systems  may  be  important  in  this  regard  since  a  shift  of  origin  does  not 

T  T 

destroy  symmetry.  We  compute  PAP  =  LDL  ,  where  D  is  block  diagonal 

with  1  by  1  and  2  by  2  blocks.  Unfortunately,  there  does  not  appear 

T 

PAP 

to  be  any  way  to  bound  m  a-priori.  Thus,  to  be  competitive  (with 
the  band  HJ  algorithm)  with  respect  to  storage  [computation]  we  should 
have  m  >n/6  (m  >N/2/3]  . 

Another  situation  in  which  we  might  wish  to  have  a  small  bandwidth  is 

when  auxiliary  storage  must  be  used.  Overlay  versions  of  band  decomposition 

2 

algorithms  can  be  implemented  most  efficiently  if  km  (1  <  k  <  3)  storage 
units  of  main  memory  are  available.  Hence,  it  is  important  to  have  m 
small.  Note  that  this  does  not  preclude  the  possibility  of  using  the  PC 
algorithm,  if  applicable.  Having  ra  small  simply  limits  the  number  of 
rows  or  columns  we  should  have  available  at  any  given  time. 

If  a  matrix  A  can  be  stored  in  such  a  way  that  only  its  non-zero 
elements  need  to  be  stored  and  considered  in  a  residual  calculation,  it  is 
clear  that  bandwidth  ordering  makes  no  sense  for  iterative  schemes  that 
require  only  a  residual  calculation. 

Obviously,  a  useful  bandwidth  reducer  must  consume  less  time  than  it 
saves  the  linear  equation  solver,  or  else  significantly  reduce  the  amount 
of  storage  required.  It  will  be  relatively  unimportant  in  practice  whether 
the  minimum  bandwidth  is  achieved,  but  we  should  get  reasonably  close  to  the 
minimum  bandwidth  in  an  economical  amount  of  time.  Note  that  an  easily 
ascertained  lower  bound  for  the  bandwidth  (not  necessarily  attainable)  can 


100 


be  obtained  by  finding  f”k/2_|  ,  where  k  is  the  largest  number  of 
non-zero  elements  in  any  row. 

Two  basic  approaches  to  ordering  for  a  small  bandwidth  are  in  current 
use.  They  can  be  classed  as  direct  (or  one-pass)  and  iterative.  The 
direct  schemes  [R3 , 05]  usually  work  closely  with  the  associated  graph, 
and  proceed  by  successively  removing  (i.e.,  numbering)  the  nodes  of  the 
graph  according  to  some  strategy  based  on  the  (usually  local)  structure 
of  the  graph.  The  iterative  schemes,  on  the  other  hand,  assume  a  given 
ordering  and  attempt  to  improve  the  ordering  (again  according  to  some 
strategy)  by  finding  appropriate  row  and/or  column  interchanges.  Since 
the  direct  methods  only  need  a  single  starting  node  to  begin,  while  the 
iterative  schemes  need  an  initial  ordering,  a  reasonable  procedure  is  to 
use  a  direct  method  to  obtain  an  initial  ordering  and  then  use  an  iterative 
scheme  to  improve  it.  The  problem  of  finding  an  initial  starting  node 
is  discussed  in  Section  5. 

We  now  describe  two  popular  direct  methods  for  bandwidth  ordering. 

A.  Spanning  Tree  Method  (Cuthill  and  McKee  [05]). 

1.  Choose  a  starting  node  x^  ,  and  define  Q  =  {x^}  . 

2.  For  each  nou.e  in  Q,  (in  the  order  in  which  they  are 
numbered) ,  number  their  unnumbered  neighbors  in  order  of 
increasing  degree . 

3.  Set  Q  =  [nodes  assigned  numbers  in  the  last  execution  of 
Step  2} . 

4.  If  |q|  =  0  ,  then  stop;  otherwise  go  to  Step  2. 


The  algorithm  is  equivalent  to  finding  a  spanning  tree  (rooted  at  the 
initial  node)  of  the  graph  G  ,  hence  the  name.  [A  tree  is  a  connected 
graph  with  N  nodes  and  N-l  edges.  A  spanning  tree  of  the  graph  G  is 
a  subgraph  of  G  which  is  a  tree  and  contains  all  N  nodes.] 

The  obvious  advantage  of  this  method  is  that  it  is  very  efficient. 

The  required  work  is  proportional  to  N  times  the  average  degree  of  the 
vertices,  and  thus  only  increases  linearly  with  N  .  Very  good  results  are 
obtained,  provided  a  good  starting  node  is  selected. 

The  minimum  degree  algorithm  [ R3 ]  is  similar  to  method  A  above 
and  is  as  follows  5 

B.  Minimum  Degree  Algorithm 

1.  Set  i  =  1  . 

2.  In  the  elimination  graph  G.^  1  choose  to  be  any 
vertex  satisfying 

\7l(x.)\  =  min  |7t(y)l 
^Xi-1 


where  G.  ,  =  (X.  ,,E.  n) 
l-l  l-l  l-l 


3.  Set  i  =  i+1  . 

4.  If  i  >  N  ,  then  step;  otherwise  go  to  Step  2. 


From  a  practical  point  of  view  this  algorithm  has  little  to  offer 
over  Method  A,  and  is  obviously  inferior  with  respect  to  the  amount  of 
work  that  is  required;  N(N+l)/2  vertices  must  be  tested.  A  practical 
modification  that  drastically  reduces  the  amount  of  work  required,  and 
actually  improves  the  results  obtained  as  well,  is  to  restrict  the 
candidates  considered  in  Step  2  to  those  having  at  least  one  numbered  neighbor. 
Nevertheless,  experience  has  shown  that  the  Cuthill -McKee  algorithm  seldom 
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produces  a  larger  bandwidth  than  the  minimum  degree  algorithm,  and  even 
with  the  above  modifications  the  latter  requires  substantially  more  work 
than  the  former. 

We  now  turn  to  iterative  methods  for  reducing  the  bandwidth  of  a 
matrix  [Rk,  T3 ] •  Here  it  is  more  convenient  to  speak  in  matrix,  rather 
than  graph-theoretic,  terms.  The  differences  among  these  iterative  schemes 
are  largely  matters  of  programming  techniques  rather  than  fundamental  ideas. 
The  general  idea  follows:  Assume  we  are  given  an  initial  ordering  yielding 
a  bandwidth  of  m  .  Non-zero  elements  satisfying  )i-j|  =  m  will  be 
referred  to  as  edge  elements.  Since  we  are  assuming  that  the  matrix  is 
zero/non-zero  symmetric,  we  will  preserve  the  symmetry  by  interchanging 
corresponding  columns  whenever  rows  are  interchanged. 

1.  Set  max  =  m  . 

2.  Try  to  interchange  rows  containing  edge  elements  with  rows  not 
containing  edge  elements  so  as  to  reduce  the  bandwidth,  and 
simultaneously  interchange  columns. 

3.  Re-compute  m  .  If  m  <  max  ,  then  set  max  =  m  and  go  to 
Step  2. 

b.  If  max  is  greater  than  or  equal  to  its  value  when  Step  k  was 
last  executed,  then  stop.  Otherwise  compute  a  vector  v  of  N 
values  as  follows: 


N 


i  -  L  UKj  h  °! 


3=1 


D-K-j  /  0} 


Order  the  equations  in  increasing  order  of  v  ,  and  order  the 
columns  correspondingly.  The  first  time  this  is  done  the  bandwidth 
may  increase;  after  the  first  step  repeat  as  long  as  the  bandwidth 
decreases.  Re-compute  m  ,  set  max  =  m  ,  and  go  to  Step  2. 
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Step  t  has  the  effect  of  reordering  the  rows  so  that  as  nearly  as 
possible  each  row  has  xhe  same  number  of  non-zero  elements  on  each  side  of 
the  diagonal  element.  It  could  be  called  the  balancing  si;age.  For  matrices 
that  have  ar:  inherent  band  structure  (as  ours  have).  Step  4  does  not  have 
much  affect,  but  for  randomly  sparse  matrices  Step  U  can  improve  the  performance 
of  the  reducer  remarkably. 


1}-.  Node  Ordering  to  Reduce  |pr(A)j 


In  the  light  of  Prop.  1.1  and  Prop.  2.1,  it  should  be  clear  that  if 
A  is  symmetric  and  positive  definite,  a  potentially  profitable  strategy 
for  ordering  is  to  look  for  orderings  which  reduce  9^  or  |Pr(A) | 

(=  lPr(L)|)  • 

The  term  "near  optimal"  as  it  appears  in  the  literature  [R3,T3] 
usually  means  near-optimal  with  respect  to  fill-in.  Under  our  assumption 
that  Pr(L)  «  N*!  ,  a  near  optimal  ordering  should  "nearly"  minimize 

£a 

jPr(A)  |  .  [Since  9p  is  a  more  difficult  function  to  work  with,  we 

have  not  tried  to  look  for  orderings  to  reduce  it.  Tacitly,  we  have 

assumed  that  an  a  yielding  a  small  |Pr(A)  \  will  also  yield  an  acceptable  Gp  .] 

As  with  bandwidth  ordering  algorithms,  there  are  direct  and  iterative 

schemes  for  near-optimal  ordering.  In  order  to  explain  the  first  (direct) 

method  we  define  the  deficiency  D(x. )  [R3  ]  of  a  vertex  x.  in  a 

1  1 

graph  G  by 

(lv.1)  Dfaj)  =  |{(x..,xk)|x..e?Kxi)  A  XfceTKxj)  A  x^x^}!  . 

Recall  the  construction  of  elimination  graphs.  It  is  easy  to  verify  that  if 
D(x^.)  =  0  ,  G.  =  (X.,E.)  is  obtained  from  G.  ,  =  (X..  ,,E.  ,)  by  deletion 
of  x^  and  its  incident  edges;  no  edges  are  added.  This  provides  the 
motivation  for  the 

A.  Minimum  Deficiency  Algorithm  [R3,T3]:  Let  GQ  =  (X,E)  .  Then 

1.  Set  i  =  1  . 

2.  In  the  elimination  graph  G^  ^  ,  choose  x^  to  be  any 
vertex  such  that 

|D(x  )|  =  min  |D(y)| 
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where 


.  G.  ,  =  (X.  _,E.  .)  . 

1-1  1-1  1-1 

3-  Set  i  =  i+1  . 

4.  If  i  >  N  ,  stop;  otherwise  go  to  Step  2. 


In  this  direct  algorithm  the  next  node  to  he  numbered  is  the  one  that 
will  introduce  the  fewest  non-zero  elements  when  it  is  eliminated.  It  is 
obviously  fairly  expensive  to  find  this  node,  since  a  deficiency  test  of  a 
node  y  involves  |??(y)  |  •  |??(y)+l)  |  /  2  edge  tests.  Since  the  graph 
usually  must  be  stored  as  a  bit  matrix,  and  few  machines  are  bit -addressable, 
these  tests  may  involve  considerable  overhead.  As  with  the  minimum  degree 
algorithm  (Section  4.3)  we  have  found  that  restricting  the  candidates  in 
Step  2  to  those  nodes  that  have  at  least  one  numbered  neighbor  does  not 
hurt  the  ordering  produced  by  the  minimum  deficiency  method,  and  this 
restriction  drastically  reduces  the  amount  of  computation  involved. 

The  following  iterative  scheme  has  been  found  to  significantly  reduce 
jPr(A) j  .  Again  we  will  revert  to  matrix  notation.  The  vector  f^  is  as 
defined  in  Section  4.1. 


B.  Profile  Reduction  Algorithm. 

N 


Compute  Q*  = 


2.  Let  the  vector  v  be  defined  by 


v  =  jr  ti|f*  -  i) 
j=i 


For  each  row  i  having  v^  >  0  ,  examine  those  rows 
j  =  i+1, i+2, . .  .,i+k  for  some  (small)  k  >0  ,  and  determine 

s.  .  of  storage  that  can  be  saved  by 
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the  number  of  words 


interchanging  rows  (and  corresponding  columns)  i  and  j  . 

If  the  maximum  s„  is  positive,  interchange  rows  i  and  j 

(and  corresponding  columns),  adjusting  the  vector  f^  accordingly. 
N 

h.  Compute  Q  =  £  (i  -  f ^)  .  If  Q,  <  Q,  ,  then  set  Q,  =  Q,  and 
go  to  Step  2;  otherwise  stop. 


The  actual  search  for  the  best  interchange  (Step  3)  is  by  far  the  most 

expensive  part  of  the  algorithm,  and  in  a  practical  situation  only  those 

rows  with  v^  greater  than  some  threshold  should  be  tested  since  the 

maximum  possible  gain  in  storage  resulting  from  interchanging  row  i  and 

row  j  is  s. .  <  v.  x  (j-l)  •  A  reasonable  threshold  seems  to  be  3  or  4. 

i  j  i 

Good  results  have  been  obtained  with  the  parameter  k  mentioned  in  Step  3 
set  to  5* 
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5 •  Sana  Experiments  with  Ordering  Algorithms 


The  coefficient  matrix  A  obtained  from  the  finite  element  formulation 
of  a  probj.an  tends  to  have  considerably  less  uniformity  in  structure  than 
the  matrix  arising  from  a  finite  difference  method  applied  to  the  same 
problem.  First,  the  node  points  of  the  finite  element  mesh  may  not  all 
play  the  same  role,  and  as  a  result  have  different  connectivities.  Whether 
a  parameter  is  associated  with  a  vertex,  side  or  interior  node  and  whether 
there  is  more  than  one  parameter  associated  with  the  node  will  greatly 
affect  the  number  of  non-zero  elements  in  its  equation.  Second,  the 
finite  element  mesh  will  very  likely  be  graded,  which  also  causes  disorder 
in  the  structure  of  A  . 

Our  aims  in  this  section  are 

(a)  to  report  on  the  performance  of  several  ordering  algorithms  and 
demonstrate  the  savings  attainable  by  using  profile  instead  of  band 
methods  for  storage  and  computation; 

(b)  to  report  on  an  intriguing  and  agreeable  property  of  the  reverse 
Cuthill-McKee  ordering  (our  terminology)  which  we  have  discovered. 

That  is,  if  the  Cuthill-McKee  algorithm  numbers  the  nodes  1,2, ...,N  , 
th  ~  the  reverse  Cubhill-McKee  (RCM)  ordering  would  be  N,N-1, ...,1  ; 

(c)  to  present  some  experimental  evidence  supporting  our  implicit  assumption 
that  the  profile  of  L  ’'3  usually  quite  dense;  i.e., 

|Pr(L)  |  «  . 

We  will  make  use  of  the  following  labels  for  the  different  algorithms 
and  quantities  in  this  section.  Some  of  them  are  repeated  in  other  sections. 


1 1 


i  > 
!  ' 


CM 

RCM 

MDG 

MDF 

BE 

PR 

BC 

PC 


1/ 

*B 


Pr(A) 
M* r(A)) 


Cuthill-McKee 
Reverse  Cuthill-McKee 
Minimum  Degree 
Minimum  Deficiency- 

Bandwidth  Reduction 
Profile  Reduction 

Band  Cholesky 
Profile  Cholesky 


Initial  Ordering  Algorithms 


1 


t  Improvement  Algorithms 


} 


Decomposition  Algorithms 


rBC  • 


multiplicative  operation  count  for  the  {  }  decomposition  algorithm 

■"  it/ 


--  storage  required  to  store  a  symmetric  or  lower  triangular 
matrix  using  the  band  oriented  method  A  (Section  4.2) 

—  storage  required  to  store  a  symmetric  or  lower  triangular 
matrix  using  the  (profile)  storage  method  3  (Section  4.2) 

--  profile  of  the  matrix  A  „ 

--  density  of  the  profile  of  A  . 


In  order  to  keep  bhe  number  and  size  of  our  tables  at  a  level  where  the 
information  can  be  readily  assimilated,  we  have  eliminated  the  MDG  algorithm 
from  consideration  because  we  found  it  to  be  much  inferior  to  the  CM  algorithm. 
As  we  mentioned  before,  it  is  natural  to  use  a  direct  ordering  algorithm  to 
obtain  an  initial  ordering  for  the  iterative  improvement  schemes  (BR  or  PR) . 

We  have  limited  our  studies  to  the  orderings  provided  by  CM,  CM-PR,  RCM  and 
MDF.  [The  hyphen  should  be  read  as  "followed  by".]  The  application  of  the 
BR  algorithm  to  the  CM  and  RCK  orderings  reduced  m  by  only  one  or  two,  and 
so  the  results  are  not  included.  The  application  of  the  PR  algorithm  to  the  RCM 
and  MDF  ordering  resulted  in  only  a  small  reduction  in  Pr(A)  ,  and  was  also 
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not  included.  We  have  limited  our  studies  to  elements  1-3;  2-6  and  3-10 
(see  Appendix  A),  and  to  the  three  domains  shown  below: 


Half-L  Mesh  Hollow-Square  Mesh  Inverted-T  Mesh 

Figure  5*1 

For  the  experiments,  the  meshes  were  subdivided  by  various  factors  as 
described  in  Section  2.3  in  such  a  way  that  for  a  given  domain  each  element 
yielded  the  same  N  .  The  reported  times  are  in  seconds  for  an  IBM  360/91 
computer.  The  values  of  9  and  V  for  each  algorithm  have  been  scaled  by 
the  values  for  the  OM  ordering.  The  actual  values  for  the  CM  ordering  are 
reported  in  parentheses.  As  before,  we  indicate  the  bandwidth  by  m  . 

We  made  use  of  the  geometry  of  the  domain  to  choose  an  initial  node 
for  the  "initial  ordering"  algorithms.  We  arbitrarily  picked  a  node  from 
one  of  the  two  most  widely  separated  triangles  jn  the  domain.  For  "long, 
straight"  domains  this  will  obviously  work  well,  but  for  U-shaped  domains, 
for  example,  it  could  lead  to  a  bad  choice.  One  should  have  the  capability 
of  forcing  the  algorithms  to  begin  at  a  particular  node  in  cases  where  the 
above  strategy  could  lead  to  an  unfortunate  choice.  From  a  practical  point 
of  view,  designing  and  executing  a  sophisticated  algorithm  in  order  to  search 
for  a  good  starting  node  would  probably  be  more  expensive  than  i+s  ultimate 


value  would  warrant. 


The  results  of  the  experiments  are  contained  in  the  following  three 


tables. 


Matrix  and  Elimination  Statistics  for  Several 
Ordering  Algorithms  for  the  Half-L  Domain 


Element 

- 

CM 

CM-PR 

ROM 

MDF 

1-3 

Time 

.183 

1.5 

.183 

16.124 

m 

20 

20 

20 

83 

1(30053) 

1 

1 

13 

N  =  2bl 

S 

1(37 5^5) 

1 

.96 

•91 

lf\ 

CO 

ir\ 

H 

II 

V 

VB 

1(5061) 

1 

1 

6 

VP 

1(4197) 

1 

•97 

•  95 

MMa)) 

.231 

•  231 

.236 

.243 

MP- r(L)) 

1.000 

1.000 

1.000 

1.000 

Fill-in 

3043 

3043 

2950 

2844 

2-6 

.26 

10.3 

.26 

20.83 

m 

42 

48 

42 

52 

gb 

1(196302) 

1.27 

1 

1.47 

N  =  242. 

•p 

1(123342) 

.36 

•  36 

•  30 

=  2581 

Z 

1(10363) 

1.15 

1 

1.25 

V 

p 

1(7413) 

.61 

.60 

.54 

iKPr(A)) 

•  197 

.328 

•357 

.361 

JKpt(l)) 

•979 

•979 

•990 

.996 

Fill-in 

5608 

2805 

2734 

2484 

3-10 

Time 

•25 

9.9 

•25 

27.24 

m 

63 

81 

63 

84 

9B 

1( 4o6i88) 

1.55 

1 

1.4 

N  =  24.1 

GP 

1(239873) 

.30 

.21 

.19 

^  =  3793 

VB 

2(15424) 

1.28 

1 

1-3 

VP 

2(10172) 

.58 

,49 

.44 

j»(Pp(A)) 

.203 

•  553 

.428 

.467 

XPr(L)) 

•979 

.985 

.990 

.996 

Fill-in 

7707 

3606 

2655 

2277 

J _ 
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Matrix  and  Elimination  Statistics  for  Several  Ordering 
_ Algorithms  for  the  Hollow  Square  Domain _ 


Time 

m 

•  17 

16 

H 

.17 

16 

9.2 

20 

9B 

1(34776) 

1 

1 

1.5 

N  =  252 

9P 

1(26299) 

1 

.99 

.95 

k£  =  1620 

VB 

1(4284) 

1 

1 

1.2 

V 

? 

1(3698) 

1 

•99 

•97 

MM*)) 

.272 

.272 

.273 

.278 

MMi)) 

1.000 

1,000 

1.000 

1.000 

Fill-in 

2510 

2505 

2’£9 

2421 

N  =  252 


=  2628  V. 


P 

MPr(A)) 

fi(Pr(L)) 

Fill-in 


.23 

36 

1(155609) 

1(95520) 

1(9524) 

1(6837) 

.219 

.982 


10.49 

46 


.23  18.3 
36  42 


.60  .62  .59 
.372  .362  .381 
.972  .991  -994 
2528  2493  2316 


1  f  cnQ.r7^Q.\ 
V 

1(232516) 
1( l8l44) 
1(10325) 
.204 


.32  25.I 

71  63 

1  .81 

.22  .19 

1  .89 

.49  .44 

.426  .465 


£(Pr(L))  .988  .988  .994  .996 


Fill-in  7898  3468  2738  2345 


Table  5.2 


Matrix  and  Elimination  Statistics  for  Several 
Ordering  Algorithms  for  the  Inverted  T  Domain 


Element 

CM 

CM-PR 

RCM 

MDF 

1-3 

Time 

.22 

2.52 

.22 

9-32 

m 

20 

20 

20 

35 

@B 

1(63283) 

1 

1 

2.82 

N  =  301 

9P 

1(31496) 

1 

.88 

.74 

ud  =  1009 

VB 

1(6321) 

1 

1 

2.52 

Vp 

1(^258) 

1 

•95 

.88 

^(Pr(A)) 

.280 

.280 

.296 

.321 

XPr(L)) 

1.000 

1.000 

1.000 

1.000 

Fill-in 

2852 

ro 

00 

1886 

2334 

2-6 

— 

Time 

.30 

12.36 

.30 

18.38 

m 

4o 

46 

40 

42 

9b 

1(230017) 

1.3 

1 

1.1 

N  =  301 

S 

1(103348) 

•34 

.34 

.28 

=  2628 
z 

1(12341) 

1.15 

1 

1.05 

V 

p 

1(7422) 

.60 

•59 

.56 

MPr(A)) 

.238 

.4o6 

.415 

.441 

rtPr(L)) 

.981 

.983 

.985 

.994 

Fill-in 

5292 

24l4 

2368 

2130 

3-10 

Time 

•  30 

11.5 

•  30 

22.2 

m 

57 

72 

57 

65 

9b 

1(441244) 

1.52 

1 

1.26 

N  =  301 

s 

1(188422) 

•  30 

•  23 

.19 

/  =  U-525 

7, 

1(17458) 

1.26 

1 

1.14 

V 

VP 

1(9887) 

•  57 

•51 

.47 

XPr(A)) 

.252 

.455 

•  515 

.560 

Mpt(l)) 

.981 

•  972 

.821 

•995 

_ 

Fill-in 

6993 

2737 

1428 

1881 
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The  information  in  the  above  tables  leads  us  to  the  following 
conclusions: 

(1)  We  appear  to  be  fully  just'  'ied  in  assuming  that  L  is  almost  dense. 

[We  have  computed  the  fill-in  for  sane  random  orderings  as  well,  and 

although  ^(Pr(L))  was  smaller  for  some  other  orderings,  we  observed 

II  L. 

that  |Pr(A^)  j  <  jPr(A^) |  =>  N  x  <  ,  where  A^  is  the  matrix  A 

T 

with  some  ordering  ,  and  =  L.L^  .  In  other  words,  reducing  the 
profile  appears  to  reduce  the  fill-in.] 

(2)  The  RCM  algorithm  seems  to  be  easily  the  best  algorithm.  The  ordering 
not  only  supplies  a  near  optimal  bandwidth,  but  also  yields  a  profile  almost 
as  good  as  the  MDF  algorithm,  which  is  prohibitively  expensive.  [There  are 
several  reasons  why  methods  based  on  elimination  graphs  are  expensive  to 
use.  First,  even  if  we  restrict  the  candidates  to  be  ordered  first  to 
those  having  at  least  one  numbered  neighbor,  the  number  of  candidates 
tends  to  be  quite  large,  particularly  for  elements  with  relatively  many 
nodes.  Secondly,  we  not  only  must  test  edges  of  the  graph,  but  we  also 
roust  usually  add  edges  as  new  elimination  graphs  are  formed.  This  addition 
of  edges  requires  computer  time,  and  also  increases  the  degree  of  the  nodes 
which  are  candidates  or  potential  candidates  for  subsequent  ordering.  Since 
the  required  work  for  each  step  of  the  MDF  algorithm  is  proportional  to  the 
sum  of  the  squares  of  the  degrees  of  the  nodes  being  tested,  these  added  edges 
can  dramatically  increase  the  amount  of  work  involved.] 

The  reason  that  the  RCM  ordering  is  superior  to  the  CM  order  (profile- 
wise)  can  be  explained  as  follows.  The  CM  algorithm  tends  to  order  the 
neighbors  of  each  node  consecutively,  and  the  non-zero  elements  of  A  thus 
tend  to  be  arranged  in  sequences  in  successive  rows  (columns)  of  the  lower 
(upper)  triangle  of  A  .  This  is  just  the  reverse  of  what  we  want  for  a 
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small  profile;  hence  the  discovery  of  the  RCM  ordering. 


(3)  It  is  very  beneficial  to  use  profile  methods  rather  than  hand 
methods.  The  following  table,  which  can  be  obtained  from  the  tables 
above,  brings  out  this  point  dramatically. 

NsXJ)omain  1!  T _ ^ 


\  I 

ExemeniN^J 


Half-1 

Hollow 

Square 

Inverted 

T 

•72 

•75 

.44 

.82 

.86 

.64 

.21 

.22 

.14 

.43 

_  I 

.45 

•35 

.13 

.098 

•097 

•  32 

.28 

.28 

Table  5-4:  9p(RCM)  /  ©b(RCM)  and  Vp(RCM)  /  Vb(RCM)  for 
Each  Element -Domain  Combination. 


(4)  Although  we  make  no  claims  about  tne  programming  of  the  ordering 
algorithms  (they  could  be  improved  by  programming  some  of  the  bit-pushing 
in  machine  language),  the  reported  times  are  an  accurate  reflection  of 
relative  numbers  of  edge  tests  (zero/non-zero  tests)  required  by  each 
algorithm.  Hence,  although  the  magnitudes  of  the  times  might  be  Improved 
by  a  more  careful  implementation,  we  would  not  expect  their  relative  size 
to  change  much. 
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6. 


4 

The  Value  of  for  Arbitrary  Elements  and  Triangular  or 

Quadrilateral  Meshes 

Suppose  we  have  an  arbitrary  triangular  mesh  with  triangles, 

V_  boundary  vertices,  and  V_  interior  vertices .  Let  S_  be  the  number 
B  1  B 

of  triangle  sides  lying  on  the  boundary  and  be  the  number  of  sides 
lying  in  the  interior  of  the  mesh.  Let  H  be  the  number  of  holes  in  the 
mesh  (domain) . 

In  order  to  characterize  the  stencil,  let  n^.  ,  rig  and  n^  be  the 
number  of  parameters  associated  respectively  with  vertex  nodes,  the  node(s) 
on  each  side,  and  the  interior  of  eacn  triangle.  For  example,  element  3-10 
(Appendix  A)  would  yield  n^  =  1  ,  ng  =  2  ,  and  n^  =  1  .  As  in  Chapter  3, 
we  let  n  =  3(n^+ng)  +  nI  . 

Our  aim  in  this  section  is  to  obtain  in  terms  of  N  ,  V  ,  S  , 

Zi  A  Jd  B 

nc  ,  ng  and  n^.  .  Our  method  of  proof  is  similar  to  that  in  [El],  where 
the  following  relations  between  mesh  parameters  are  proved. 

(6.1)  N  =  -rr  (S  +  2S  )  =  V  +  2Vt  +  2H  -  2  . 

'  A  3  B  I  B  I 

Consider  the  following  typical  mesh: 


Figure  6.1 
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Our  strategy  is  to  successively  remove  triangles  from  the  mesh  in 
such  a  way  as  to  leave  all  remaining  triangles  with  at  least  one  side 
inside  the  mesh.  (Thus,  triangles  1  or  2  in  Figure  6.1  can  be  removed, 
but  3  cannot.)  As  we  remove  triangles,  we  will  count  the  number  of  non-zero 
elements  removed  from  A  .  We  ignore  those  cases  where  elements  are 
accidentally  zero  because  of  the  regular  properties  of  the  mesh  and/or 
the  coefficients  of  the  differential  operator.  We  neel  the  following 

Lemma  6.1 

Let  a  triangle  of  type  1  (having  two  external  sides)  be  removed  from 
the  mesh.  Then  is  reduced  by 

Zt 

(6.2)  a1  =  n2  -  (ng  +  2nv)2  . 


Proof: 

The  total  number  of  elements  in  A  due  to  the  interaction  of 

2 

parameters  associated  with  a  triangle  is  n  .  However,  not  all  the 
connections  are  removed  by  the  deletion  of  triangle  lj  those  corresponding 
to  parameters  lying  on  the  remaining  side  of  triangle  1  (including  its  end 
nodes)  are  not  removed,  and  there  are  (n„ +  2n  )  such  non-zero  elements. 

b  V 

This  proves  the  lemma. 


Lemma  6.2 


Let  a  triangle  of  type  2  be  removed  from  the  mesh.  Then  is 

Lx 


reduced  by 


(6.3)  a2  =  2n(n].+  ns)  -  (n^n^2  +  2(ng+nv)2  . 


1 


I 


a 

I 


I 
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Proof: 


As  in  Lemma  6.1,  we  first  note  that  the  total  contribution  to  il^ 

2 

from  the  connections  of  parameters  in  triangle  2  is  n  .  However,  two 
of  the  triangle  sides  and  their  incident  vertex  remain  in  the  mesh,  so  the 
connections  of  their  parameters  must  not  be  counted  unless  they  correspond 
to  different  remaining  sides.  The  truth  of  (6.3)  can  be  demonstrated  by 
assuming  the  equations  in  question  are  all  grouped  last  in  A  and  examining 
Figure  6.2.  The  submatrices  marked  with  an  asterisk  are  the  parts  removed 
from  A  (in  the  diagram  below) 
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Figure  6.2 
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Summing  the  elements  in  the  marked  submatrices  yields  (6.5)* 

Now  suppose  the  mesh  has  a  hole  in  it.  Eventually  we  will  reach  a 
situation  where  the  hole  is  bounded  at  one  plae  by  a  single  side  such  as 
depicted  below. 


Figure  6.3 

Such  a  side  will  be  referred  to  as  a  connecting  side.  We  present  the 
following 

lemma  6.3 

Let  n  ,  n_  and  n  be  as  above.  Then  the  removal  of  a  connecting 
V  b  1 

side  from  the  mesh  reduces  by 

2  P 

(6.h)  cfj  =  ng  +  *tognv  +  2n^ 

The  proof  is  similar  to  that  employed  in  Lemma  6.2  and  we  omit  it. 

We  can  now  prove  the  following 


Theorem  6.b 


Let  V  ,  S  and  H  be  the  number  of  vertices,  sides  and  holes 
respectively  in  a  two  dimensional  triangular  mesh.  Let  n  ,  cr^  , 
and  Oj  defined  as  above.  Then  is  given  by 

(6.5)  ■  n2  +  (V-3)^  +  (S+3 -H-2V)<t2  +  Hcr^  . 


Proof: 

Suppose  we  reduce  our  mesh  to  a  single  triangle  by  successively 

removing  triangles  of  type  1  and  type  2,  (and  connecting  sides  if  any), 

2 

leaving  A  with  n  non-zero  elements.  In  order  to  reduce  the  mesh 
to  one  triangle  we  must  remove  V-3  triangles  of  type  1,  since  removal 
of  a  type  2  triangle  or  a  connecting  side  removes  no  vertex.  Thus  the 
removal  of  type  1  triangles  results  in  the  reduction  of  by  ^(V-3)  . 
Now  each  hole  will  result  in  the  occurrence  of  one  connecting  side  being 
removed  during  the  reduction  of  the  mesh,  and  this  will  reduce  by  cr^H  . 

These  two  forms  of  demolition  account  for  the  removal  of  2(V-3)+H  triangle 
sides,  and  three  sides  remain  in  our  final  triangle.  Hence,  we  must  have 
removed  S-3-2(V-3)-H  triangles  of  type  1,  accounting  for  a2(S+3-2V-H) 
non-zero  elements.  Summing  the  above  expressions  yields  (6.5). 

Using  (6.1),  can  be  expressed  in  terms  of  other  (perhaps  more 

commonly  available  or  easily  obtainable)  mesh  parameters. 

A  similar  analysis  can  be  carried  out  for  quadrilateral  elements.  If 
the  mesh  has  no  holes,  there  are  three  cases: 
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Case  1 


v 

Case  2 


L _ i 


Case  3 
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If  N-j  is  the  number  of  quadrilaterals  in  the  mesh,  we  can  obtain  the 
following  expressions  using  the  same  tecnniques  as  we  did  for  the 
triangular  mesh: 


(6.6) 


where 


=  a171  +  Cg7g  +  +  n2  , 


°i  =  r‘2  "  (2nv+  ns^2 


°p  =  n(nI  +  nv+2ng)  -  (nI  +  nv+2ng)2  +2(nTr+nc)2  , 


V  S; 


a3  =  n(ni  +  ns>  -  (Vns>  +  6(ns  +  V  > 


n  =  U(nv+ng)  +  n^.  , 


and  7^  ,  y 2  and  7^  are  non-negative  integers  satisfying 

2yl  +  7  2  = 

(6.7)  5 71  +  272  +  7^  =  S-4 


h  +  +  >5  ’  'b'1 
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The  numbers  7^  ,  y2  and  7^  are,  respectively,  the  number  of 
instances  of  case  1,  case  2  and  case  3  encountered  during  the  reduction 
of  the  mesh.  The  coefficient  matrix  of  (6.7)  is  singular,  reflecting 
the  fact  that  there  are  alternate  ways  to  demolish  the  mesh,  resulting  in 
different  values  of  7^  ,  and  7^  .  We  can  resolve  the  problem  as 
follows.  First  we  observe  that  =  c^-2o^  .  Using  (6.7)  in  (6.6),  we 
have 

(6.8)  n£*  =  a171  +  a2(V  -  h  -  2y1)  +  2N  -S+2+7^  +  n2 

u 

=  y1(a1  -  2os  +  a5)  +  c2(V-k)  +  cr5(2N  -S+2)  +  n2 

=  a  (V-h)  +  a,(2N  -S  +  2)  +  n2. 

<=  7  rj 

If  our  mesh  has  H  holes  in  it,  and  we  rename  the  a ^  of  Lemma  6.3 
as  0^  ,  equation  (6.8)  becomes 

(6.9)  =  a2(V-U)  +  a5(2N_^-S+2  +  H)  +  cr^H  +  n2. 

This  information  is  important  because  it  allows  us  to  allocate  the 
exact  amount  of  storage  for  the  non-zero  elements  of  A  as  soon  as  we 
know  the  mesh  and  the  characterization  of  the  polynomial  on  each  element. 
It  is  also  useful  in  checking  that  our  mesh  is  consistent  and  our  program 
is  working  correctly. 

The  expressions  we  have  derived  allow  us  to  obtain  an  estimate  for 
the  density  M(A)  =  /  N2  for  finite  element  coefficient  matrices.  Using 

(6.4)  and  (2.5.6),  along  with  (2.5.2)  and  (2.5.3),  we  have 
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(6,10) 


-0(A)  = 


+  °p 

^  2  'WA 


[(nI  +  I  “V  +  2  nS)MAl2 


1  , 


r(W“i) 


na 


!  i 


where 


! 

I! 


u 


y 


(6.11) 


= 


°1  + 


2(nl  +  2  \  +  2  nS^ 


Ihe  average  number  of  non-zero  elements  per  row  of  the  coefficient  matrix 

1  1  5 

is  obviously  given  by  |  ^  |  ns). 

Some  typical  values  of  T  and  average  number  of  non-zero  elements  per  row 
are  tabulated  below. 


Element 

Average  number  of 
non-zero  elements 
per  row 

1-3 

14.00 

7.00 

2-6 

5-75 

11.50 

3-^ 

8.08 

20.20 

3-10 

3-78 

17.00 

4-6 

6.59 

29.67 

4-15 

2.94 

23.5 

5-6 

10.15 

45.67 

5-21 

2.48 

31.00 

Table  6.1 


u 

f  > 

Li 
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7 •  Analysis  of  Storage  and  Computational  Requirements  Cor  a 


Model  Problem 

In  this  section  we  obtain  estimates  of  9p  ,  9p  ,  Yp  and  Vp  for 

a  particular  mesh,  in  older  to  demonstrate  the  savings  attainable  by  using 

profile  methods  rather  than  band  methods.  The  mesh  we  consider  is  obtained 

2 

by  subdividing  a  unit  square  into  p  small  squares  of  side  l/p  ,  and  then 
subdividing  each  small  square  into  two  right  triangles.  An  example  with 
p  =  6  is  given  below. 


Figure  7*1*  Six  by  Six  Regular  Right  Triangular  Mesh 


As  in  Section  4.6,  let  n^  ,  ng  and  n^  denote  the  number  of 
parameters  associated  respectively  with  vertex  nodes,  the  node(s)  on  each 
side,  and  the  interior  of  each  triangle.  We  number  the  nodes  diagonal  by 
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diagonal,  beginning  at  the  lower  left  hand  corner,  and  considering  nodes 
lying  between  consecutive  diagonals  as  a  row.  For  example,  stencil  3-10 
(Appendix  A)  would  yield  the  numbering  shown  below: 


Figure  7.2 


As  usual,  we  denote  our  symmetric  positive  definite  coefficient  matrix 

T 

by  A  ,  with  Cholesky  factorization  LL  .  Making  use  of  (6.1),  and 

recalling  the  meaning  of  N.  ,  V_  ,  S_,  and  H  ,  we  can  write 

A  B  d 


where  N  is  the  dimension  of  A  .  For  a  p  by  p  mesh  such  as 

p 

Figure  1 .1,  N  =  2p  and  S_.  =  V_  =  4p  ,  yielding 
A  ±5  £> 

(7-2)  N  =  (nv  +  3ns  +  2n];)p2  +  0(p)  . 

For  this  ordering,  the  bandwidth  of  A  is  given  approximately  by 

(7-3)  m  =  (p^Jriy  +  3png  +  (2p-l)ax  =  (ry  +  3ng  +  2pnj)p 

=  PBP  • 

op 

.  PB  k  .23 

Thus,  9b  =  p  and  VB  =  . 

We  now  wish  to  obtain  estimates  for  9p  and  Vp  .  To  simplify  the 
algebra,  we  assume  n^  =  1  ,  ng  =  n^  =  0  (piecewise  linear  polynomials) . 
We  can  then  prove  the  following 

Theorem  7.1 

For  a  p  x  p  regular  rectangular  grid,  the  coefficient  matrix  A 
obtained  using  piecewise  linear  polynomials  satisfies 

(7  «}0  |Pr(A)  |  =  ^  +  |p2  +  |^P  • 
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Proof: 


Considering  the  first  node  point  as  diagonal  1,  and  recalling  the 
& 

definition  of  8^  (Section  4.1),  we  see  that 


61  =  0 
Sg  =  1 

A  A 

6?  =  2  ,  =  2 


"3 

.A 

>5 


5^  =  3  ,  5^  =  3,  6^  =  3 


(7-5)  5^  =  r  ,  ^-^-+2  <  i  <  ^|^+1  ,  l<r<p  . 


A 

Now  for  the  main  diagonal,  6^  =  p+1  ,  and  for  the  diagonals  above  the 


main  diagonal  we  can  show  that 


(7.6)  &£_.  =  r  ,  ■^r"1Kr'2).  <  i  <  -  l  ,  3  <  r  < 


p+1 


N 


Using  the  formula  |Pr(A)  j  =  (8^+ l)  along  with  (7.5)  and  (7.6) ; 


we 


have 


|Pr(A) 


■ 


p 


i(i+2)  +  p(pf2)  +  1+  V  i(i+l) 
£=1  £l 


=  £  2i  +  3i  +  p(p*-2)  +  1 


1=1 


=  +p2  +  2p  +  l  =  %^  +  |p2  +  ^+  l. 


Thus,  using  the  profile  storage  scheme  rather  than  the  band  storage 
scheme,  we  can  save  about  one  third  of  the  storage  for  A  or  L  .  It  is 
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straightforward  (but  tedious)  to  show  that  for  a  general  stencil; 


(7.7)  |Pr(A)j  <  f  PpP5  , 

with  pp  0B  . 

Recall  that  in  Section  4.1  we  showed  that 

K  8^  +5) 

(7.8)  9  =  I  -H —  • 

i=l 

Again  assuming  n  =  1  and  n„  =  n  =  0  ,  we  can  prove  the 
V  b  X  “ 

following 


Theorem  7*2 

Let  the  FC  algorithm  be  applied  to  A  .  Then  the  number  of 
multiplicative  operations  required  to  compute  L  is  given  by 


(7.9) 


1  4 
£  p 


3p^  + 


4l  2 

Tp 


19 

2 


Proof: 

Using  (7.8)  along  with  (7.5)  and  (7*6);  we  have 
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£  i(i+l)(i+lf)  , ,  p(jp+2)  (p+5)  +  J,  1&A&51 

P  JL  2  2  f~L  2 

1=1  1=1 


*-  (i3t6i2+7i)  *  ijutim. 


k 


1  b  .  ,  3  .  4l  2  ,19 
Tf  P  +3p  +  -jj-  P  +  -fv 


Again,  with  some  tedious  algebra,  we  can  show  that 


(7.10) 


13k 


It  is,  therefore,  possible  to  halve  the  computation  required  to  compute 
L  by  using  profile  instead  of  band  methods.  Note  that  we  did  not  prejudice 
our  comparison  by  ordering  diagonally,  since  the  bandwidth  would  be  the 
same  if  we  numbered  our  nodes  in  the  usual  row  by  row  fashion. 


In  this  way,  the  dimension  of  (see  Section  3*4)  can  he  reduced  by 

.  We  can  carry  out  a  somewhat  superficial  analysis  of  the  model 
problem  discussed  in  Section  4.f  to  show'  the  ravings  possible  by  using  this 
technique.  To  simplify  the  analysis  we  will  consider  the  use  of  the  band 
Cholesky  algorithm,  and  consider  element  3-4 .  It  is  easy  to  show  that 
using  ^he  ordering  of  Section  4.7,  the  band  width  m  is  about  5p  and 
the  number  of  equations  N  is  about  Sp"  ,  yielding 

(8.3)  SB  =  125P4  • 
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Now  consider  the  corresponding  quantities  if  we  apply  static  condensation 
2 

and  eliminate  2p  variables  before  assembly.  The  bandwidth  m  is  now 

2 

only  about  3p  ,  and  N  =  3p  •  Thus 
(8.4)  eB  4  27P4  • 

It  is  fairly  easy  to  show  that  the  number  of  multiplicative  operations 
required'  to  eliminate  the  variables  is 

(8-5)  9gc  =  32p2  , 

which  means  the  technique  pays  (in  terms  of  multiplicative  operations) 
for  this  particular  element,  problem,  and  solution  method  for  all  p  .  Of 
course,  its  use  might  be  justified  for  storage  reasons  alone,  even  if  it 
did  not  reduce  the  computation. 

In  general,  9gc  is  given  by 

n5 

(8.6)  [  -^  +  3nI(nI+ l)(ng  +  nv)  +  9^0^.+  ng)2]  N&  . 

Another  technique  sometimes  used  in  connection  with  solving  finite 
element  equations  is  the  so-called  frontal- solution  method  [12,  K2].  The 
basic  strategy  is  to  combine  the  assembly  and  decomposition  of  A  by 
alternating  between  the  accumulation  of  coefficients  of  the  equations  (most 
of  the  coefficients  depend  on  more  than  one  element)  and  the  elimination. 

A  square  submatrix  of  A  (in  some  stage  of  reduction)  is  the  only  main 
storage  required.  The  matrix  corresponds  to  "active"  variables;  that  is, 
variables  which  have  not  been  eliminated  and  for  which  there  are  non-zero 
coefficients  in  the  equations  so  far  encountered.  The  subset  of  active 
variables  continuously  changes  as  new  elements  are  processed.  The  main 
point  that  is  usually  made  in  favor  of  these  schemes  is  that  variables  are 
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eliminated  as  soon  as  possible,  rather  than  in  a  predetermined  order. 
However,  this  flexibility  is  obtained  at  a  rather  high  cost  in  programming 
complexity,  and  the  question  of  ordering  has  really  only  been  moved  a 
level  higher.  The  problem  of  optimal  equation  ordering  has  been  replaced 
by  the  problem  of  optimal  order  of  element  assembly.  Our  general  impression 
is  that  these  methods  will  be  most  valuable  when  main  storage  is  at  a 
high  premium. 


CHAPTER  5 


FINITE  ELEMENT  SOLUTIONS  TO  SOI®  SELECTED  PROBLEMS 

1.  Introduction 

In  this  chapter  we  will  present  finite-element  solutions  to  some 
much-studied  problems  for  which  numerical  solutions  have  been  presented 
in  the  literature.  Our  purpose  is  not  necessarily  to  present  more 
accurate  solutions  than  have  been  presented  before,  but  rather  to 
demonstrate  that  the  finite  element  method  enables  us  to  obtain 
comparatively  good  results  efficiently  and  without  resorting  to  special 
methods.  We  will  provide  evidence  suggesting  that  the  finite- element 
method  is  not  only  desirable  because  of  its  flexibility  regarding 
irregular  domains  but  is  competitive  or  superior  to  common  alternate 
methods  with  respect  to  efficiency. 

The  term  efficiency  is  somewhat  difficult  to  define  quantitatively 
since  storage  requirements,  computer  time,  and  manpower  have  different 
relative  costs  in  different  situations.  Loosely,  efficiency  will  mean 
"number  of  correct  digits  per  dollar". 

We  would  like  to  emphasize  that  the  finite-element  solutions  presented 
in  this  chapter  have  been  produced  by  a  general  program.  No  r  ,e  was  made 
of  any  sp  c ial  characteristics  of  the  problems  other  than  those  an  engineer 
would  reasonably  expect.  For  example,  we  graded  the  net  small  near  the 
re-entrant  corner  of  the  L-shaped  membrane  eigenvalue  problems  (Section  2, 
this  chapter),  but  we  did  not  attempt  to  incorporate  "singular  functions" 
into  the  basis  [F2,F6l. 

Since  we  are  using  a  Ritz  procedure,  our  computed  eigenvalues  for 
the  problems  below  are  upper  bounds  for  the  true  eigenvalues. 


2.  The  L-Shaped  Membrane  Eigenvalue  Problem 

The  L-shaped  membrane  eigenvalue  problem  has  been  studied  by  many 
authors.  For  background  materiel,  see  Forsythe  and  Wasow  [F5]  and  Moler 
[M3L  and  for  various  special  computational  methods,  see  Reid  and  Walsh 
[R2],  Fix  [F2],  Schwartz  [s?L  and  Fox,  Henrici,  and  Moler  [ f6] -  The 
domain  R  consists  of  the  union  of  three  unit  squares,  and  we  wish  to 
find  the  stationary  values  V  (0  <  k^  <  k^  <  k^  <  . . .)  of  the 
functional: 

(2.1)  I[u]  =  J[  [u2  +  u2]dxdy  /  JJ  u2dxdy  , 

R  R 

where  u  =  0  on  the  boundary  S  . 

The  interesting  aspect  of  this  problem  is  provided  by  the  re-entrant 
corner,  which  leads  to  unbounded  derivatives  of  the  fundamental  eigen¬ 
function  in  the  neighborhood  of  the  corner.  Thus,  the  eigenfunction  is 
difficult  to  approximate  by  functions  which  do  not  exhibit  a  similar 
behavior.  The  value  k^  =  9.65972  reported  in  [f6]  is  accurate  to  the 
last  digit,  and  we  will  use  it  for  comparison. 

Our  first  experiments  make  use  of  the  following  triangular  mesh: 
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We  are  obviously  making  use  of  the  symmetry  of  the  ijrst  eigenfunction 
here,  and  have  graded  the  net  appropriately  near  the  corner.  In  the 
tables  below  k  is  the  factor  by  which  the  mesh  of  Figure  5.1  was 
sub-divided.  The  eigenvalues  we  found  using  inverse  iterative  [W2]  with 
an  initial  guess  of  9.6  .  The  computed  is  believed  to  be  correct 
to  the  last  digit.  Set-up  time  includes  the  time  required  to  generate 
the  mesh  and  order  the  nodes  as  well  as  the  time  required  to  generate 
and  assemble  the  equations.  The  missing  times  in  the  table  were  so  small 
that  they  were  meaningless.  All  times  are  in  seconds  on  an  IBM  560/9!- 


* 

Yh 

for  piecewise 

linear 

functions 

k 

Number  of 
Equat ions 

Bandwidth 

Set-up 

Time 

Solution 

Time 

* 

Y\ 

1 

5 

•  3 

•  33 

— 

3.4003 

2 

22 

6 

•57 

•05 

1.0089 

3 

51 

12 

.62 

.10 

.4605 

4 

92 

12 

•93 

•  17 

.2640 

5 

145 

15 

1.33 

.30 

.1718 

6 

210 

18 

2.54 

•55 

.1210 

7 

287 

21 

2.73 

.90 

.0901 

8 

376 

24 

3.53 

1.28 

.0699 

Table  2.1 

The  rate  of  convergence  of  the  .computed  to  ^  as  k  -*  «>  is 
obviously  exceedingly  slow. 

The  systems  of  linear  equations  involved  in  the  inverse  iteration 
routine  we  solved  using  a  band  Gaussian  elimination  routine.  The  code 
appears  in  Appendix  3.  It  is  interesting  to  note  that  the  set-up  time 
dominates  the  solution  time  in  all  cases.  This  is  due  in  part  to  the 
characteristics  of  the  IBM  36o/91>  which  has  a  very  fast  floating-point 
arithmetic  unit  and  a  look-ahead  instruction  stack.  Both  features  tend 
to  make  "number  crunching"  tasks,  such  as  Gaussian  elimination,  proceed 
rapidly  and  efficiently.  The  set-up  pucedure,  on  the  other  hand,  requires 
considerable  bookkeeping  and  branching.  Programs  of  this  type  do  not 
make  effective  use  of  the  powerful  machine  features  mentioned  above. 

Another  reason  for  the  relatively  large  set-up  time  is  that  we  are  using 
low  degree  polynomials.  The  number  of  triangles  to  be  processed  (and 
the  associated  overhead)  is  larger  with  respect  to  N  than  it  would  be 
for  quadratics,  for  example.  Note,  however,  that  the  ratio 
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(Set-up  time) /(Solution  time)  is  steadily  (if  slowly)  decreasing. 

Our  second  experiment  again  makes  use  of  the  mesh  of  Figure  5.1, 
but  we  now  use  polynomials  of  higher  degree  to  demonstrate  how  efficient 
they  can  be.  Table  2.2  contains  results  for  polynomials  of  degrees  1 
through  6  ;  in  all  cases  the  original  mesh  was  used.  Our  inverse 
iteration  routine  for  these  experiments  used  a  symmetric  indefinite 
equation  solver  using  the  pivoting  algorithm  of  J.  R.  Bunch  [B13!  •  The 
code  for  this  computation  appears  in  Appendix  B . 

*  ^ 
for  Piecewise  Polynomials  of  Degrees  1  to  6 


Degree 

Number  of 
Equations 

Bandwidth 

Set-up 

Time 

Solution 

Time 

* 

1 

5 

3 

.33 

-- 

5.4003 

2 

22 

11 

.43 

.1 

.3720 

3 

51 

24 

•70 

•25 

.0160 

4 

92 

42 

•87 

•95 

.0063 

5 

145 

65 

1.7 

5.02 

.0034 

6 

210 

101 

2.68 

6.02 

.0021 

Table  2. .2 

It  is  obvious  that  for  this  problem  the  use  of  polynomials  of 

degree  >  1  are  considerably  more  effective  than  linear  ones. 

* 

It  is  interesting  to  note  that  the  obtained  using  quintic 

* 

polynomials  (Table  5*2)  yielding  145  equations  is  comparable  to  the 
obtained  by  Moler  [M3]  using  finite  difference  methods  on  a  uniform  mesh 
with  h  =  l/lOO  (yielding  15,000  equations).  Our  storage  requirements 


were  virtually  the  same;  we  required  15385  words  (including  the  storage 
of  A  and  B  of  the  generalized  eigenvalue  problem  Ax  =  kBx  ) . 

Moler's  Fortran  program,  written  specifically  for  this  problem,  took 
about  12  minutes  to  execute  on  an  IBM  7090.  Thus  there  is  a  factor  of 
roughly  150  in  execution  times.  The  ratio  of  speeds  of  the  arithmetic 
units  is  about  100,  while  the  effective  memory  speed  ratio  is  about  10. 

The  ratio  of  times  for  other  operations  lie  somewhere  between  these  two 
extremes.  We  feel  we  can  safely  say  that  the  finite  element  method  is 
at  least  fully  competitive  with  finite  difference  methods  for  this 
problem . 

It  was,  of  course,  not  necessary  to  use  inverse  iteration.  We  could 
have  used  a  method  due  to  Peters  and  Wilkinson  [PI]  which  essentially  finds  the 
zeros  of  det(A-kB)  .  Although  the  running  times  would  be  considerably 
larger  than  for  inverse  iteration,  the  required  storage  for  our  quintic 
problem  would  be  a  total  of  10,556  words  (storage  for  A  and  B  and  en 
additional  ((m+1)  x  (2ir<+l))  words  for  the  determinant  evaluation).  Both 
this  method  and  inverse  iteration  can  be  used  to  find  subdominant 
eigenvalues,  whereas  the  method  used  in  Moler  [M3]  is  applicable  only 
for  an  end  eigenvalue.  To  find  subdominant  eigenvalues  using  his  technique 
would  require  some  form  of  deflation  to  render  the  dominant  eigenvalues 
equal  to  zero.  To  avoid  making  the  coefficient  matrix  dense,  the 
deflation  would  have  to  be  done  implicitly  which  implies  that  the  eigen¬ 
vectors  corresponding  to  dominant  eigenvalues  would  have  to  be  available. 

We  feel  that  the  ability  of  the  high  order  finite  element  methods  to 
obtain  respectable  lesults  using  only  moderate  numbers  of  parameters  is 
particularly  important  for  eigenvalue  problems  because  it  enables  us  to 


138 


apply  well  known,  dependable  methods  for  finding  the  eigenvalues  of  the 
discrete  problem. 

We  again  emphasize  that  we  are  not  implying  that  finite  element 
methods  are  the  best  ones  to  use  for  solving  this  particular  problem. 
Indeed,  the  method  proposed  by  Fox,  Henrici  and  Moler  [F6 ]  is  probably 
the  best  known  method  for  finding  the  eigenvalues  of  the  L-shaped 
membrane.  However,  the  use  of  such  techniques  requires  information 
which  may  only  be  known  to  an  expert  in  the  field,  and  the  utilization 
of  them  in  a  general  code  is  complicated. 
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3.  Eigenvalues  of  Rhombical  Domains 


Bounds  for  the  eigenvalues  of  rhombical  domains  have  been  obtained 
by  Moler  [m4],  Birkhoff  and  Fix  [ B7],  and  Stadter  [ s4] .  Moler  obtains 
his  bounds  using  a  method  of  particular  solutions,  and  Stadter  obtains 
bounds  using  the  method  of  intermediate  problems  [S^].  In  this  section 
we  will  show  that  with  finite  element  formulations  having  relatively  few 
parameters  we  can  get  close  to  or  within  the  bounds  produced  by  the  methods 
described  in  the  above  references. 

The  problem  we  considered  is  the  equation  (2.1)  of  Section  2  with 
a  rhombical  domain  of  side  «  and  skew  angle  9  as  indicated  below: 


y 


Figure  5«2 


1*10 


and  the  results 


Our  first  experiment  takes  no  account  of  symmetry, 
are  compared  with  some  of  the  bounds  presented  by  Moler  [M4].  These 
results  are  summarized  in  the  table  below.  As  before,  k  indicates  the 
factor  by  which  the  input  mesh  (indicated  by  the  dashed  line  in  Figure  5*2) 
has  been  subdivided.  N  is  the  number  of  equations,  m  is  the  bandwidth 
and  d  is  the  degree  of  the  piecewise  polynomials. 

The  method  employed  by  Moler  utilizes  particular  solutions  to  the 
Laplacisn  operator,  and  the  20  particular  solutions  used  were  carefully 
chosen  to  agree  with  symmetries  of  the  eigenfunction  being  approximated. 
Each  bound  calculation  required  about  20  seconds  on  an  IBM  360/67.  Our 
calculations  were  done  on  an  IBM  360/91.  Our  set-up  times  (for  each 
example)  and  solution  times  (for  each  eigenvalue)  have  been  included  in 
Table  3.1  for  comparison  purposes.  The  360/67  and  360/91  have  radically 
different  design  features  and  a  comparison  between  the  two  machines  is 
difficult.  The  largest  ratio  of  execution  times  this  author  has 
encountered  between  identical  programs  run  on  the  two  machines  is  15,  and 
that  was  for  a  very  special  program.  Usually  the  ratio  is  from  three  to 
six  and  is  almost  always  less  than  ten. 


Case  1:  Rhombical Membrane  Eigenvalues:  9  =  30°  . 


* 

* 

X2 

$ 

Set-up  and 
Solution  times 

Moier's 

Bounds 

2.51921 

2.52c06 

'  5.33333 
5.33334 

1 

8.47510 

8.50997 

w  20  sec  per 
eigenvalue  on  an 
IBM  360/67 

k-2, 

N  =  49, 
m  =22, 

d  =  4 

2.52302 

5-33341 

7.26942 

8.5047 

.41 

.13 

wBM 

1 

2.52284 

5-33339 

7.26653 

8.49424 

.69 

•95 

k  =  4, 

N  =  225, 

m  =  65, 
d  =  4 

2.52279 

5.33334 

7.26611 

8.49374 

1.36 

3.7 

k-2, 

N  =  8l 
m  =  35; 
d  =  5 

2.52284 

i 

_ 

5-33340 

i 

_ 

7.26651 

8.49420 

1.2 

.4 

Table  3-1 


Case  2:  Rhombical  Membrane  Eigenvalues:  6 


i  ; 

* 

X1 

* 

X2 

* 

* 

Moler's 

Bounds 

2.01218 

2.01248 

4-90375 

is.  90^03 

5.15659 

: 

|  5.15750 

7.99206 

7.99594 

N  =  49, 
ra  =  22, 

4  =  4, 
k  =  2 

2.01232 

4.90567 

5.16407 

8.00979 

N  =  121, 

.n  =  43, 
d  =  4, 
k  =  3 

2.01226 

4.90U05 

5.15755 

7.99516 

N  =  8l, 

m  =  35> 

4  =  5, 

k  =  2 

2.01226 

4. 90^08 

5.15730 

7.99851 

N  =  196, 

m  ^  69, 
d  =  5, 
k  =  3 

2.01225 

4.90389 

5.15705 

7.99508 

Our  first  observation  is  that  again  the  higher  degree  polynomials 
appear  to  be  more  efficient .  For  example,  in  Case  1,  using  quintics 
with  N  =  8l  and  m  =  35  yields  results  as  good  as  the  quartic 
example  having  N  =  121  and  m  =  k3  .  For  Case  2,  the  singularities 
in  the  derivatives  of  the  eigenfunctions  near  the  corners  are  less 
troublesome,  and  the  value  of  the  higher  degree  polynomials  is  less 
pronounced,  although  still  apparent.  We  point  out  that  our  numbers  are 
upper  bounds  to  the  true  eigenvalues . 

Moler ' s  method  is  clearly  superior  if  accurate  upper  and  lower 
bounds  are  required,  or  if  approximations  to  many  eigenvalues  are  desired. 
However,  his  method  may  be  expensive  and/or  difficult  to  apply  to  problems 
whose  operators  do  not  have  simple  or  easily  generated  families  of 
particular  solutions. 

Moler' s  results  are  for  moderate  values  of  9  ,  and  only  for  the 
fixed  membrane  problem.  We  now  wish  to  make  some  comparisons  with  the 
results  of  Stadter  [s4]  and  Birkhoff  and  Fix  [B7] •  They  report  bounds 
for  9  =  30°  (15°)  75°  for  the  rhombus  fixed  at  all  edges,  and  Stadter 
reports  bounds  for  the  rhombus  fixed  at  two  opposite  edges  and  free  on 
the  remaining  two  edges. 

We  begin  with  the  fixed  membrane  problem.  The  bounds  reported  are 
for  eigenvalues  corresponding  to  eigenfunctions  which  are  symmetric  with 
respect  to  both  diagonals.  For  purposes  of  comparison,  we  restricted 
our  first  experiment  correspondingly.  Our  domain  is  the  hatched  area 


shown  below: 


Figure  3.1 


The  boundary  condition  un  =  0  is  imposed  along  the  boundary  of 

the  hatched  region  interior  to  the  rhombus.  This  is  easy  to  do  since 

it  is  a  natural  boundary  condition. 

A  B 

In  the  discussion  below  X^  and  X^  are  computed  eigenvalues 

reported  by  Birkhoff  and  Fi:--  [E7].  The  values  X^  were  obtained  using 

the  Rayleigh-Ritz  method  with  the  approximating  space  spanned  by  the 

affine  transforms  of  the  first  30  eigenfunctions  of  the  square  membrane. 

B 

The  values  X^  were  obtained  using  a  second  space  of  dimension  30  which 

included  special  singular  functions  having  the  appropriate  behaviour  at 

the  corners.  In  both  cases,  only  the  even-even  symmetry  class  was  sought. 

U  k  L  k 

The  eigenvalues  X^;  and  X  ;  below  are  upper  and  lower  bounds 

supplied  by  the  method  of  intermediate  problems  as  applied  to  the 

rhombical  membrane  problem  by  Stadter  [ S4] .  The  superscript  k  indicates 

the  number  of  intermediate  problems  used,  and  is  the  size  of  the  two  (dense) 

matrix  eigenvalue  problems  which  must  be  solved  to  obtain  the  bounds. 

O  o 

In  [s4]  Stadter  reports  bounds  for  k  =  15  and  9  =  30  (15  )  75  >  and 
in  [Wl]  he  reports  bounds  for  k  =  30  and  9  =  45 


I  k 

The  eigenvalues  \  *  *  below  are  our  finite  element  results  for 
piecewise  polynomials  of  degree  i  yielding  k  by  k  (band)  eigenvalue 
problems . 

In  Table  6  we  compare  k^  ,  \->3  ^  ^  and  ^  .  For  the 

r  n  n  n  n 

B  5  55 

following  reasons,  we  feel  it  is  fair  to  compare  k"  with  even 

g 

though  the  space  which  produced  XT  is  only  of  dimension  30.  First, 

Birkhoff  and  Fix  report  that  2^  times  as  much  computer  time  was  needed  to 
find  the  k^'s  as  the  k^'s  for  a  given  angle  9  .  (We  assume  this  increase 
was  caused  by  complications  introduced  by  the  incorporation  of  the  singular 
functions.)  Second,  our  eigensystems  have  band  structure,  whereas  theirs 
are  dense.  A  tnird  somewhat  qualitative  reason  might  be  termed  the 
"nuisance  factor".  All  our  finite  element  computations  were  done  with  a 
general  purpose  program;  no  special  modifications  with  regard  to  special 
basis  functions  or  geometry  were  necessary. 

The  time  required  to  generate  the  finite  element  eigenproblem  for 
each  angle  on  the  IBM  360/91  was  about  0.6  seconds.  About  0.2  seconds 
were  required  to  find  each  eigenvalue  using  inverse  iteration.  By  comparison, 

T) 

2  minutes  were  required  on  an  IBM  7094  to  produce  the  k^'s  for  a  given  9  . 
[Since  it  appears  that  the  major  portion  of  the  time  used  was  for  the 
generation  of  the  eigenproblem  rather  than  its  solution,  the  fact  that 
Birkhoff  and  Fix  used  a  method  yielding  all  the  eigenvalues  of  the  discrete 
problem  is  relatively  unimportant . ]  Roughly  4  seconds  of  IBM  360/91  time 
was  required  to  produce  the  upper  and  lower  bounds  (k^J  ^  and  k^^) 
for  each  angle  using  the  method  of  intermediate  problems.  The  results  are 
tabulated  below  for  n  =  1,2,  ...,6  and  9  =  30°  ,  45°  ,  60°  and  7jf  . 
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Symmetric  Eigenvalues  for  the  Fixed  Rhomb ical  Domain 


n 

x5>55 

r. 

xB 

n 

n 

xu'»> 

n 

Case  1:  9  =  30° 

1 

2.5228 

2.5238 

2.5224 

2.5241 

2 

8.4939 

8.5060 

8.4916 

8.5008 

3 

14.233 

I4.256 

14.224 

14.261 

4 

17.156 

17.183 

17.139 

17.167 

5 

27.173 

27.IIO 

26.983 

27.096 

6 

29.606 

29.620 

29.433 

29.537 

Case  2.  0  =  45° 

1 

5.5210 

3.5210 

3.5201 

3.5263 

2 

10.158 

10.190 

10.154 

10.173 

3 

18,785 

18.864 

18.737 

18.802 

4 

22.115 

22.135 

22.095 

22.214 

5 

30.153 

30.289 

29.785 

29.942 

6 

39-663 

39.582 

59.493 

39.777 

Case  5:  0  =  6o° 


1 

6.3238 

6.3598 

6.3217 

6.3485 

2 

14.968 

15.088 

14.958 

15.005 

3 

25.333 

25.571 

25.202 

25.338 

4 

38.064 

38.981 

37.436 

37.774 

5 

43.581 

43.717 

43.480 

44.013 

6 

54.267 

56.379 

51.883 

52.575 

Case  4:  9  =  75° 


1 

20.194 

20.283 

20.185 

20.407 

2 

36.373 

36.452 

36.301 

36.617 

3 

53.596 

53.562 

52.79*+ 

53.499 

4 

76.746 

80.125 

70.951 

72.660 

5 

110.20 

Hl.52 

90.964 

94.982 

6 

154.89 

144.38 

112.87 

121.75 

Table  5»3 
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We  offer  the  following  observations: 


(1)  The  remarks  of  Birkhoff  and  Fix  suggesting  that  their  Rayleigh-Ritz 
methods  yield  much  more  accurate  upper  bounds  than  the  method  of 
intermediate  problems  seems  to  be  barely  justifiable.  In  [B7]  their 

comparisons  of  X^  and  X^  are  against  X^*^  for  9  =  30  ,  6o° 

q  U  30 

and  75°  .  For  9  =  45  the  comparison  is  against  X^  ,  and  for 

A 

this  case  X^  was  a  sharper  upper  bound  in  only  half  of  the  cases, 
and  although  X^  was  better  in  all  cases,  it  was  only  marginally 
better  in  most  of  them. 

(2)  The  upper  bounds  produced  by  the  finite  element  method  appear  to  be 

g 

fully  competitive  with  the  X^'s  ,  and  are  appreciably  better  for  the 
lower  eigenvalues. 

(3)  Experiments  with  polynomials  of  various  degrees  again  indicate  that 
efficiency  increases  with  increasing  polynomial  degree. 

(4)  Our  finite  element  solutions  made  no  use  of 

(a)  information  about  the  behavior  of  the  solution  near  the  corners 
of  the  domain 

(b)  the  fact  that  the  domain  is  affinely  equivalent  to  one  in  which 
the  eigenproblem  can  be  solved  exactly. 

We  feel  that  these  points  are  important  because  the  utilization  of 
(a)  appears  to  be  awkward  in  i  general  implementation,  and  (b)  places  a 
rather  severe  restriction  on  the  application  of  the  method  of  intermediate 
problems . 

We  now  turn  briefly  to  the  fixed-free  rhombical  membrane  eigenvalue 
problem.  Gtadter  [S4]  restricted  his  attention  to  eigenvalues  corresponding 
to  eigenfunctions  symmetric  with  respect  to  the  center  of  the  rhombus.  It 
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was  not  convenient  for  us  to  restrict  our  problem  correspondingly,  so 

4  15  4  lc 

we  solved  the  "full"  problem.  We  report  results  for  X^  and  '  . 


and  X 


4,15 

5 


for  the  Fixed-Free  Rhombical  Membrane 


9 

^,15 

X1 

\L>  15 

X1 

XU,!5 

,*,15 

b 

,L,15 

P 

m 

1.2545 

1.1820 

2.8550 

4.9105 

4.6585 

5.15^7 

2.8550 

2.5046 

5.6553 

7.6455 

6.988I 

9-5382 

H 

8.5400 

6.8058 

15 .045 

19-177 

14.255 

27.458 

Again,  with  a  moderate  number  of  parameters  we  can  easily  improve 


on  the  upper  bounds  produced  by  the  method  of  intermediate  problems. 


4.  A  Dirichlet  Problem 


We  now  consider  finite  element  solutions  to  the  following  problem: 


wX  ay 


(^.i) 


u  =  e  cos  y  on  S  U  T 


The  boundary  S  is  a  1.28x1.28  square  and  T  is  a  .25  X  .25 
square  with  lower  left  corners  at  (0,0)  and  (.5>»5)  respectively. 
The  inpit  mesh  is  indicated  by  the  dashed  lines  in  the  diagram  above. 
Experiments  were  run  on  an  IBM  360/91. 

Details  of  the  various  polynomials  can  be  found  in  Appendix  A.  As 
before,  N  indicates  the  number  of  finite  element  equations  and  k  is 
the  factor  by  which  the  input  mesh  was  subdivided.  The  profile  Cholesky 
algorithm  and  the  RCM  ordering  (see  Chapter  4)  were  employed  in  all 
cases. 

We  begin  by  comparing  different  element/mesh  combinations  which 
yielded  roughly  the  same  accuracy. 
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Element 

N 

k 

Set-up 

Time 

(Seconds) 

Solution 

Time 

(Seconds) 

Operations 

for 

Solution 

x  (io+5) 

Error 

x(io“5) 

Storage 

L  A 

2-6 

540 

5 

3.18 

•53 

154 

1,86 

10, 544 

3000 

3-4 

216 

3 

1.56 

.22 

37 

1.28 

3,099 

1620 

3-10 

l8o 

2 

1.13 

.13 

28 

1.63 

2,425 

1188 

4-15 

72 

1 

.65 

.04 

7 

1.14 

684 

468 

Table  4.1 

Table  4.1  demonstrates  dramatically  the  value  of  using  high  degree 
polynomials  for  solving  this  problem.  Set-up  times,  solution  times  and 
storage  requirements  decrease  as  the  degree  d  of  the  polynomial 
increases.  Observe  the  striking  decrease  in  the  operations  required  to 
solve  the  generated  linear  system. 

We  now  present  some  experiments  using  the  initial  mesh  and  varying 
the  degree. 


Operations 

for 


Element 

N 

Set-up 

Time 

Solution 

Time 

Solution 

(x  105) 

Error 

Storage 

L  A 

3-10 

36 

■  .482 

— 

2 

2.35(-4) 

209 

144 

4-15 

72 

.65 

.04 

6 

l.l4(-5) 

684 

468 

5-21 

120 

1.10 

.13 

21 

4.o6(-7) 

1692 

ll4o 

6-28 

180 

1.86 

.20 

53 

1.16 (-8) 

3465 

2340 

Table  4.2 


Again  the  case  for  higher  degree  polynomials  is  apparent.  Compare,  for 
example,  the  third  entries  in  Tables  4.1  and  4.2.  Their  demands  on 
system  resources  are  about  the  same,  but  the  error  for  the  quintic 
is  more  than  an  order  of 'magnitude  less. 

To  compare  the  above  results  with  what  could  be  expected  using 
finite  difference  methods  we  solved  the  problem  using  the  standard 
five-point  difference  operator  on  a  uniform  square  mesh  with  mesh  width 
of  l/lOO  .  The  solution  was  obtained  using  an  imbedding  approach  [G1,B15] 
which  makes  use  of  very  fast  direct  methods  for  solving  the  discrete 
Laplacian  equations  on  a  rectangular  domain.  The  set-up  time  for  this 
procedure  is  large  («  25  seconds  for  our  problem  on  the  IBM  360/91)  and 
consists  of  computing  a  qxq  "capacitance  matrix".  In  our  problem 
q  =  100  and  the  computation  of  the  capacitance  matrix  involves  solving 
q  127x127  rectangular  problems.  However,  once  this  initialization  is 
done,  we  can  obtain  a  solution  to  our  given  problem  by  solving  2  rectangular 
problems  and  a  dense  q  by  q  system  of  linear  equations.  Assuming  that 
we  have  computed  and  decomposed  the  capacitance  matrix  beforehand,  we  can 
solve  our  problem  in  about  .7  seconds.  This  latter  "solution  time" 
has  been  found  to  be  superior  to  SOR  or  ADI  solution  times  (by  factors 
of  5  to  8)  for  a  number  of  typical  problems  [BIJl- 

Thus,  a  (conservative)  entry  in  Table  k.2  for  finite  differences 
would  be 


N 

Solution  Time 

Operations 

Error 

Storage 

15.504 

•7 

106 

7  •  10-6 

22000 
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Each  solution  of  the  127  x  127  rectangular  problem  requires  about 
O  seconds  on  the  IBM  36o/S'l*  Thus,  even  using  the  iterative  scheme 
(based  on  fast  direct  methods)  proposed  by  George  [Gl]  which  avoids  '-ue 
calculation  of  the  capacitance  matrix  is  unlikely  to  compare  favorably 
in  overall  time  (solution  and  set-up  time)  with  the  last  entry  in 
Table  k.2.  Anyway,  an  equally  important  consideration  is  storage 

requirements,  and  the  last  entry  in  Table  h,2  requires  only  5805  words. 

,  -8 

The  observed  error  for  the  sixth  degree  polynomial  was  1.16x10 
compared  to  7  X  10  ^  for  the  difference  equations . 

Again  we  should  point  out  that  there  are  still  better  ways  to  solve 
this  problem  if  we  are  prepared  to  take  advantage  of  its  particular  characte 
istics-  Moler  (private  communication)  solved  the  problem  by  using  a  linear 
combination  of  particular  solutions  as  a  trial  solution  and  determining  the 
coefficients  of  the  expansion  by  minimizing  the  two-norm  of  the  error  at  a 
discrete  set  of  points  on  the  boundary  Sl)T.  The  least  squares  solution  of 
a  26  X  15  problem  was  all  that  was  required  and  the  program  was  only  a  few 
pages  long;  the  error,  however,  was  around  10  ^  . 
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.-ipp--iidi  x  A  : 


Si":-  Representative  Triangular  Elements 


The  labels  on  the  stencils  below  indicate  the  parameters  associated 
with  each  node.  When  no  label  appears,  the  function  value  v  is  to  be 
asocmed.  Tne  two-part  hyphenated  name  refers  respectively  to  the  degree 
>_f  tne  polynomial  and  the  number  of  nodes  associated  with  the  element. 


Element  Name- 


Stencil 


0/S  360  Fortran  Code  for  Finite  Element  Methods 


Appendix  B: 


The  codes  '-n  this  appendix  are  all  written  in  0/s  36O  Fortran.  There 
are  five  separate  programs  whose  relation  is  depicted  in  the  following  chart 


The  transmission  of  data  from  one  program  to  the  next  is  done  via  Fortran 
units  1,  2  (and  3  if  an  eigenvalue  problem  must  be  solved)-  All  data  sets  read 
and  written  are  sequential,,  so  the  program  would  work  without  alteration  whether 
the  storage  devices  are  disks,  drums,  or  tape  units.  Only  changes  in  the  job 
control  language  would  be  necessary. 

The  program  is  set  up  to  find  the  stationary  values  of  I^v]  +  I  [v]  ana 
[ v ]/j[ v ] ,  where 

IqM  =  If  civx  +  C2Vy  +  e3y2 

R 

I2[v]  =  ||  c^v  dx  dy, 

R 

and 

J[v]  =  ||  v^dxay 
R 

Here  c^,  c^,  ana  c^  are  constants,  and  c^  is  a  function  supplied  by  the 
user  in  the  subroutine  FUNC.  For  further  details  and  sample  input  see  the 
comments  in  the  code  of  PHASE  2  and  in  Appendix  C. 

With  minor  changes  in  the  mainline  of  PHASE  2,  other  terras  can  be  included 
in  1^  and  Ig,  and  with  somewhat  more  substantial  changes  variable  coefficient 
quadratic  terms  could  be  handled.  Note  that  phases  1  and  3  would  not  need  to  be 
altered. 

Piecewise  polynomials  of  degree  d  (l  <  d  <  9)  utilizing  (^  )  value 
and  first-derivative  parameters  can  be  selected  by  the  user  and  are  automatically 
generated  by  the  program. 

The  choice  of  method  for  solving  the  generalized  eigenvalue  problem  depends 
on  the  relative  size  of  the  number  of  equations  and  the  bandwidth,  as  discussed 
in  section  k-3-  Both  programs  assume  that  the  initial  shift  (SHIFT)  supplied 
by  the  user  is  a  good  one;  the  decomposition  of  A-SHIFT*B  is  done  only  once 
at  the  beginning  of  the  iteration. 


165 


Phase  1 


Generation  of  the 


Mesh  and  Ordering 


of  the  Nodes 


a.  *- 
c  - 
Li  ct 
Li  U i 


uj  a 

a  u 

O  Li 


a  z 

UJ  Ui 

»-  X 
z  < 


cr  «o 
c  • 
o  o 


O  *N* 

2 

as  - 


u  — 

X 
a.  uj 

Ui 

aj  ct. 


U  — 


-J  <t 

-c  a 
z  a 


•  i\i  t*t 

•t  in  m 


:  <© 

-o  • 


w  -  w  l/>  — ■  • 


h  •—  Ui 

-  3 
►-  f\.  2 

s  ~ 

ro  Ui  *- 

a  z 

c  o 


%/t  l/»  «■  » 

a.  *-  —  ia 

o:  2  o  »-  » 

«q  O  *  r» 

C  -*  r- 

a  »- 


i  L  L  i 
J  U  U  4 
:  2  2 


>C  »  -C  «- 
- 

Ui  —  u.  o 

>-  O  *-  7 

— .  -a  —  a. 
a  ai  a.  O 
-*  a  -*  a 


O 

2 


•  2  3 

>  <X  2 

r  u/  — 

>  o  *- 

2 

U 


oca 
c.  o 
o  o 


.  a.  a  • 

I  N  It 


>7  *7 

i  ri  rn 


O  O 

•  * 
—  O 


U  WWW 


o  o  c 


INiH  •>* 


H  —  —  •©  — 


in  »-«  r\»  r*  — ■ 


ant-  uuu 


.  o  a  * 

,CU( 

:  2  2  . 


uj  Nnn~*rum« 

»-  ■ -  - ’«<>o  «  a 

<j  rxj  ro  — - - a  a  a  -  it  u  >- 

.  ^  fy  (O  u  iu  Oj  uj  li  i/t 

.  w  w  w  r  2  2  *-  >-ut  J 

:uuc,u  a  a  —  >*-*>  o  -j 

,22  2U<_JU££.U.OCO< 

:cua:  at  ooojco22XU> 


.  a  a  « 
s  c.  o  c 
j  u  l_  . 
:  2  2  < 


o  o  o  uuu 


2  a 


•  “  o 

2  2  V 

U  UJ  N 

u  u  * 
■*  z  >f 


•  c 

l/> 

>  cr 


*oi- 
u  JO 
X  C 
7  - 


w  Vi*  ", 

nj  -  k-  ; 

i  o  a 


z  u-  v 

—  a 


—  >.  L 
V  u  2 
;*  > 


7  y  > 

i—  u,  u 
U  Ui  w 


U  a  r  a 


in  ►- 
*  2 


2  •■ 
r  i/i 
>  ►- 
*  —  Cl 

v,  C 
2  \ 


r*.  * 


.  v 


vl  <  C 

►-  c  c. 

*x  V  \ 


—*  m 


7  OL 
t-  V» 


k-  w  <  *-  O 


C  O  M  «*«  Ui 


•t-  N  M  1 


U  —  t-  < 


a  ~x  i 
►-  u  « 
in  2  • 


7  a  7  2  a  O 


3  v.  u 
.  a  a.  i_’ 
—  2  2 


?  ft 
Z  Z 


VC.  V 

^  —  a  < 

►-  v  l  -> 


u  —  ■ 

’•”11 


uu  uuu 


168 


0 


Ll 


fl 


uj 


3 

L 


BEGIN  PAIN  ORDERING  ICC? 


OHiSEl:  (TRISNOULMICN 


Generation  of  tie  Equations 


COMPUTE  SOME  CONSTANTS  for  storage  maps, 


11 

s*“ 

l; 


a.  kj 
•  z  — . 

*k  -J  < 

rsi  w  _j  — 


j  N  \  J  N  M 


— .  o 
u.  o 


O'  X  — 
UJ  v> 

Z  a 


«NJ  Trt 
X 


O  J  —  U»  *NJ 
Z  O  x,  Q 

~  MSI  Z  ► 


C  K» 

M  J\l  f\J  UJ  K 

Z  Z  Z  +J  < 
>-  J-  3  X 

♦  ^  ♦  I/)  u. 


;z.^ 
*  N  Z  Z 
X  m  n 

O  M  CD 
:  —  Z  Z 


4  o  *-»  a  ♦  : 

Z  -J  >  <M 

l/l  a.  33  H  X  7  2  ' 


C/1  •  — 


O  t/1  wl  h -(  tNJ  «  i/'  -T  >—  O 

- *  — - #  CJ  >  • 

•  O  O  V9  ^  33 

I  O  kj 


,  Z  X  3  ~  ~  • 


H  O 

z 

wi  Q. 

a  z 


—♦  UJ 

mi  Q- 
=  > 


M  N 

i  <  CO 


mi  O 
I  M  UJ 


VI  w 

X 

Q£  U. 


:  3  * 
.  •  c 

*  o " 


a  O  u.  O  o 


II  M  M  3  -0  •—  Wl 

~  ~  t-  < 

—  <—  KJ  UJ  <  _J 

<7  cr>  ►-  m  »-  r  ~ 

• - v>  —  oc 

O  O  *1  h  i  C  H- 

J  J  JU.  ja.« 


o 


o 

o 


o  o  o 


~  ~  a 


•*  *-  _j 


•>  »  o  o  »  ’■* 

XT  T  *  X 

— •  —  •  —  o  — 

z  a  vuo  o 

mm  —  w  UJ  w 

z  z 

•  UJ 

o 

3 


uj  uj  uj  O  uj 


r 
< 

3 

n 

_  a: 

'Z  ^  a. 


~  o 


CL 

z 


o 


o  -  — 

H  X  < 
UJ  -J 

o  a  — 
3Zn 

33  *- 
—  J  * 
Z 

•**  ®*  kj 
T.  O  - 
v>  D  *  y 
a  «j  — 

►-  »-  w  O 

z  a  *- 
•  <  -» 

•  Oi  5. 
N  ON 


<oa.  «j  < 
—  o  o 
o 
o 


U  U  3  O  <3 


1 

I 

1 

KJ 

1 

U» 

1 

1 

o 

1 

1 

VI 

1 

i  i- 

o 

v> 

1— 

i 

l 

<T 

u; 

1  » 

* 

3. 

* 

i 

0 

o 

1 

X 

> 

<v< 

i 

• 

t 

UJ 

UJ 

UJ 

3 

( 

• 

V*. 

a: 

1  • 

-J 

V) 

Z 

u 

• 

>- 

1  N  ►— 

a 

( 

• 

< 

t 

z 

O 

l 

KJ 

r 

1  U  O 

C30 

a 

UJ 

i 

VI 

1  VI  Z 

O 

3 

VI 

O 

l 

UJ 

a 

v> 

1  < 

<£ 

z 

MM 

» 

3 

o 

VI 

I  X  <s 

c 

* 

a 

MM 

» 

mj 

u. 

u 

k  CL 

3 

r 

mm 

w 

i 

< 

z 

1 

UJ 

i 

> 

• 

MM 

MM 

(3 

i 

> 

t  a 

-3 

■ 

• 

a 

LLl 

1 

fn 

3 

t 

o 

u 

i  c 

o 

c 

a 

M- 

CO 

* 

V- 

MM 

w 

1  u. 

o 

» 

J  ’ 

'U 

MM 

> 

_J 

M* 

k— 

< 

H 

* 

II 

O  v> 

m4 

o 

• 

i 

3 

< 

v> 

i 

^  C 

1 

< 

z 

MM 

* 

13  1 

< 

or 

3 

< 

a 

u. 

< 

vi 

1  <3 

rst 

vu 

MM 

u 

v> 

M 

VI 

■u  l 

OJ 

mJ 

< 

1  Q 

1 

Ml 

MM 

CL 

IM 

O 

L> 

1 

« 

(O 

*• 

T 

J 

1 

UJ 

*• 

.« . 

* 

V 

» 

c 

-* 

► 

C 

3 

l  3 

a 

« 

-J 

Ml 

a 

o 

IU 

Q 

VI  1 

r— 

J  3. 

• 

<mi 

< 

V 

L- 

z 

VJ 

X  » 

VI 

Uj 

j: 

<  Z 

u 

— 

3 

UJ 

z 

u 

o 

1“ 

sO 

Z  ac  » 

X 

|  MI 

o 

< 

u 

or 

• 

MM 

UJ 

—  i 

a 

i/' 

1 

• 

• 

3 

•• 

U' 

-t 

1  L. 

X 

II 

m* 

» 

m4 

V) 

M 

•X 

3 

-J 

a 

►  1 

►*• 

u. 

3 

Ol 

P 

X 

Cf 

Vi 

mJ 

o  ^  1 

UJ 

VI 

« 

O 

UJ 

rvj 

z 

X 

u 

•* 

MM 

*•  1 

X 

jj 

1 

z 

z 

Z 

V 

r 

M 

z 

a 

z 

1 ' 

K. 

3  | 

< 

3 

u- 

1  UJ 

c 

Uj 

» 

mi 

»— 

(M 

*— 

*■ 

of 

• 

1 

CL 

Uj 

|  •/' 

M 

a 

* 

w 

•• 

M 

a. 

< 

a 

« 

1 

W 

3. 

ii 

H 

> 

>/l 

» 

2 

I- 

— 

X 

o 

<v  l 

C. 

mJ 

►— 

Q 

I  3 

•; 

< 

O 

M 

fsi 

V- 

o 

P” 

M-- 

“• 

z 

1- 

< 

u 

1  U 

mi 

UJ 

fS 

4 

o. 

►“ 

LL 

-» 

T 

[• 

1  o 

UJ 

C 

a 

z 

? 

v> 

M  L  > 

K. 

1 

C 

< 

< 

vS 

1 

'L 

►- 

K.  ^ 

•V 

VI 

l 

VI 

cl 

i  a 

V> 

M 

» 

O  x 

N» 

W 

O  MM 

r 

1 

z 

UJ 

</> 

»  > 

UJ 

MM 

c 

o 

u- 

o 

_j 

K 

K.  w 

«-« 

■*** 

Of 

1 

o 

x 

< 

-» 

■j 

u 

i- 

UJ 

UJ 

C  N  C  -T 

l 

o 

T 

UJ 

• 

k— 

>- 

o  v' 

j; 

a. 

X 

► 

M 

Z 

— 

l 

1- 

p  * 

C 

O 

1  <5 

r* 

MJ 

MM 

VI 

VI 

MM 

•o 

V 

K» 

1 

•t 

(l 

Cj 

1 

VC 

'J 

T 

a 

s 

u. 

a 

a 

1— 

<M  mm 

1 

KJ 

rv 

3 

z 

1  l/c  UJ 

2 

►“ 

N 

■N. 

-j 

K. 

»  « 

» 

O 

U» 

( 

*-• 

a 

_J 

IU 

1  — 

MM 

— 

a  x 

a 

v» 

1 

mJ 

X 

_j 

1 

u 

_l 

VI 

o 

Z 

z 

Z 

uj  uj  u>  or 

9 

*t 

» 

v> 

►- 

1  t 

T 

n 

ri 

r* 

O  j 

• 

_J 

1 

MM 

3 

i  r-»  o 

a 

-J 

> 

r 

T 

UJ  (V 

UJ 

mi 

UJ 

1 

»— 

KJ 

KJ 

l 

UJ 

Uv 

4 

T 

T 

*-  j 

►- 

< 

T 

t 

MM 

X 

X 

j: 

<  V 

a 

1 

a 

r 

3  n 

C 

z  o 

Z  UJ 

< 

1 

z 

< 

< 

1 

1 

1 

1 

v> 

o 

<wl 

O 

c- 

« 

O 

1 

• 

*  1 

1 

1 

7. 

c 

c 

c 

1 

O  U  sj  u 

C 

C 

c 

UJ 

o 

o 

1 

1 

u  a  o  o 

o  o 

of 

-»  o 
•  z 


3  O  ^  -J 
az»- 
Z  MJ  o  • 
-  *  r 

•  v»  *  * 


x  a 
•—  ►- 

U-  <7 


>  «  *  ° 
•  I-  at  3 
n/  Z  Hi 


L 

i; 

L. 


*.  x  : 
x  o  : 


a  so  • 

-J  3  ' 
-J  <  * 
<  uj  i 
O  X  ( 


178 


I 


,5a 

'I 

M 


o  < 
—  uj  X 

7  UJ  UJ  J 

o  x  r  u» 

O  O  k~  o 
UJ 

-1  O  Z  Uj 

—  z 
✓»  r  — 

u.  «-  C  »— 
^  I  y  3 
3ff  K3 
3  a: 


♦  uui*- 
at  cc 
QwO 


►*  DSTa 
Iu-jj 
o  o  o  r 
—  <  a.  h- 
a 

<  z  o 

y  jCj 
z  x  -  ^ 
►-  r 

SL  <  -J 

C  ^  a. 
iu  u.  .j 
t-  <  uj 

<y  >& 

nr  n  w  < 
7  ►-  wQ 

u-  <  X  z 

Og-< 

ai  at  •  t/> 
7  t-  Wl  »- 


C  % 
7  wtT>»- 

•-  M  1/1  w 

o  t-  a.  ^ 

-  -  JU> 

-O  <  O 


—  W  J 

rsi  U)U 

— .  r 

>  ~  LL. 

-  -£ 

—  Z 

O  cC 

—  —  UJ 

>  »-  > 

o  o 


— 

<  4 

* 

«d 

» 

*-  —  <  > 

i 

~  >  U. 

x  a. 

i 

■f  -*  c*  >  J 

3 

UJ 

►—  Jt 

3  -» 

> 

« 

Z 

•  - 

r  » »- 

« 

3  —  UJ 

i  »- 

s  \  s 

-J  Z 

i 

#  #  *  u.  u  » 

UJ  || 

•*4 

w  UJ 

Z  tJ 

T>  «  C. 

z  —  uj 

* 

V  w 

CL  >0  — 

i 

n »-  z  ^ 

i  — 

i 

cc  X  cc  _/  1 

t-  <1 

^  3 

H 

N 

T. 

- - 

n 

* 

►- 

w  ►— 

—  >— 

z 

3 

l 

z  u  O 

1  o  z  z  z 

t 

UJ  U  O/  X  <  1 

<  >  — 

11 

_i  Z 

a 

ri  »st  uj  zz 

\n 

Z 

3 

U-.  < 

uj  < 

O 

1 

3  z  > 

i  ** 

0  3  0 

i 

o  J  O  *  >  i 

3  u* 

<  — 

S  GC 

z 

X 

> 

t-  t- 

rr  wo  — 

cC 

* 

C 

*-  r 

-  >-  X 

3 

a 

t 

ul  C  >  ■! 

i  -/ 

a;  r  x 

i 

uj  c*.  u  j  1 

>  >“ 

o  O  NUJt- 

O 

O  X  t- 

_J 

3 

# 

ac 

—  or 

—  a: 

►— 

o 

<£ 

» 

1  3l 

Y  Y  Y 

1 

»-  **•  >-  <  3  1 

<  UJ 

1— 

u.  ^ 

Z  u- 

•— 

»— 

o 

o  cc  r 

-J 

*-  O 

cc 

a.  n 

a  a 

uj 

z 

3 

i 

r  zca 

t  > 

O  CO 

> 

z  o  z  a  » 

>  X  O 

O 

o 

o 

< 

uj  Z 

# 

3 

Z  W 

1  U. 

Z 

w 

vl 

1 

»-  a.  a.  <x 

1  — 

O  O  O 

• 

—  O  ►-  a.  at  i 

U  K  O 

UJ 

o 

O 

a  u> 

* 

ts> 

• 

» 

1 

*  • 

i 

i 

« 

o 

1 

1 

1 

l 

m  i\t 

*■ 

c 

o 

1 

1 

» 

l 

• 

•* 

1 

t 

1 

1 

1 

1 

1 

» 

* 

) 

o 

o 

__ . 

o 

__ 

o  u  o  o  o  u 

_ _ _ 

_ _ 

o 

o  o  u  o  u 

_ _ 

O  O  O  LJ 

_ . 

_ _ _ 

_ _ 

IC 

i 

tc 

_ _ 

o 

_ 

I  a.  jj  «n  » 

o  c  ►  * 


»-<  Vi  ♦  . 

Of  <£ 


—  (\*  — 


x  ~  l> 

—  C  —  '+  — 

<r>  o  *  # 


!  «t  ^  —  «  *■* 

■  j  I  ^  — 

■  t-  a  T  -  *  u 

’  a  —  - 

•  _J  —  w  7 

j  H  «/>  </l  — 

<  u.  ■”  X  ►- 

1  'J  “  H  a  u  1? 

O 


M  It  t*  II 
C  X  -*  ^ 


uj  <  /v  «  n 

—  .«■  *-  c  »  y. 

cf  -  •  U</<  3 

Tt  a  f  at  t  >A 


\ 

\ 


—  X 

v.  Of 
O 

I 

* 

o 

=  8 

— «  VO  —  ► 


3 

C  O  i 
<  * 

o  * 


L  -»  —  3  <s> 

> 

J  H  w  "<  Q, 

r  q.  i/>  m  > 

>irtu 

l  WCCCC7 

z  — 

0  —  H  H  1-0 
-  U.  *-  UJ 

t.  u.  —  ~  a.  3 

>  U-  ^  *Ni  V-  2 

-  m 

ro>>M-3 


0 

30 

C  a: 

Of  Q- 

«s 

uj  m 

X 

►*  < 

a 

O  H* 

3 

^  -3 

O 

>  O 

a 

-■* 

N 

Ci 

0 

3 

* 

*e 

O 

c  0 

0. 

►— 

UJ  0 

a.  1 

Ui 

in  _j 

0 

O 

3  to 

z  — 
a  3 

11 

O 

m  z 

0  *  — 

X 

o 

z 

>• 


o  —  • 

«  #  It  • 


>  -  3 
‘  *  —  3 


♦-  g 

O  ** 

'T  Z  - 

►»  7 

z  — 

cl  fu  0 

O  »« 

-  CL  O 

0  ►- 

> 

Z 

z  c 

0  z 

UJQ 

O 

0 

0 

ZD 

O 

0 

0 

3  N 

m  4 

uj  -• 

-i 

tD  i/> 

<  ►- 
*-  z 

UJ 
ujx 
►-  T. 

<  a 

OC  » 


cn  -£  «-*  •  «•* 


•  K  M  —  — 


u 

tn 

O  nj 

n  — 

:1s 


O  —  —  . 

z. 


.  u 
• I-  3 
z  z 
o  — 
o  »- 
z 
o 


sr 

*  * 

H  I 

3 

N 

CO  8 


*  in  * 

I  Z  I 

*  e 


» lu  ui 
i  cl  3 
l  >-  Z 


-«  3 
3d  O 


■  3 

:  z 


>  •-  3 
Z  Z 


£  8 


e 

z 

—  IIJ 

O  <_J 

Z  ' 

3 

uj  — 

^  «  . 

Uj  — 

o 

Z  £ ' 

£  2 

£S 

>  O 


jHwUUJ 

jOaD 
N  C>  Z>  Z 

•  »-  a  »-  — 


o  u. 
uj  o 
c. 


<3  o 
o  UJ 

—  o 


■o  — 
— »- 
-J  UJ  < 

►-  X 
N  — •  CC 

*  O 

z  JC  u 


1  , 


C  ~ 
u  o  m 

(TV  -* 

•  —  l/> 
— .  u  ►- 
>  u  c. 

I  UJ  «n 
3  C 

•  o  * 

X 


I 


►  in 


iuwm 
Q  •  — 
-Ot 

z  — 


co  —  a  x 


<  a  * 

ui  >  ~ 
oc  uj  m 
*sl  — 


m  •  z 

2£  . 
z  OJ  o 
UJ 

^  .£ 

y  m 

CL  Z 


z  -  o 
I  o 

in  •> 


«/»  n  m  ►  j. 

nj  »-  h- 

k  a  i-  a  -j  •• 

<y  ►-  u.  z  z 

*  u. 

C  M  v  <t  v_> 

UJ  *U  c  -»  Z 
HSOUJ 
7  u  VI  K  »  » 

—  Z  —  —  in  on 

—  r.  q  a 

»-  V  >»  v.  j  z 

u  z  z  z 

—  r  o  n 

j  i  y  j 


u  u  u 


X 

a  — 


tUJHj 

—  C.  -> 

O  2  N 

•0  *\!  3 

4  M  --  s-  7 

«  M  >uj 

u  a  x  «j 

UJ  u.  UJ  t)  < 
w  C  _j  •  > 

uiuja  j-* 
»-  k  y  <  D 
24CJUJO 
»*  •“  *J  ft  uj 


•  UJ 


£2 
*  UJ 


in  o 
X  UJ 

«  c 


'-U  ~4 

o 

II 


A  H  «  M  V) 


—  ~  M  3 
x;  c  « 

«  ^  — 

X  w  w 

UJ  *  w 

-  C  X 
in  -  z  x  •*  r 

r  -«  —  3  a  x 

^  —  w  c  z  3 

<  —  a  c 

a  Ki  *«-o“ 

>c  —  —  —  — 


w*  3  C  O  C.  C 
—<<<<< 
O’  uj  UJ  UJ  UJ  UJ 

I4yj.ua 


2 

2 


3 

Of 

O 


< 

Of 

O 


c 

I- 

D 


m  ~h 
X  0  u. 
3  ru 
Z  M 


rvJ  • 


z 

uj  ui 


at  ui 
O  »s| 


o 

o 

o  o 


INI 

“  "*  «u  *  — 

■  —  u.  -J  m  —  in 


-  2! 


O'  —  f\J 
*3  w 
U  » 

—  —  ^ 

C  r  -4  «  — 

0*0  Cr 
— •  — •  *.  O' 

X 

<C  w  «  W  <0 


•to  —  —  >■ 


UI  <  UJ  <  UJ 

**  a  *“  f£  ” 

CC  3  a  Cj  <X 
^  U  -«  U  s 


3  ft. « 
Z  ^ 

<  Z  ■ 


h  --  -n  — 

-  — «  w  >-  U 

—  —  UJ  <  3 

v-  11  «  h  r  ? 
•J  •“  *■•  Lk.  *** 

T  nJ  ft  3  ►— 


O  o 
>  o  ^ 


£8 


gouo 


000 


a  < 

O  3 


y  0 
c  z 

o  «i 


0000 


3 
O  - 


—  —  3 
■1/11/17 
XX  — 
‘  a  a.  i* 

7 

J  o 


181 


r? 


r 

►- 

o 

</i  • 

X 

z  < 

C  at 

r  o 
r  o 

a c 
O  a. 

Z  u> 

*-  Z 

C 

Ui  Z 
a  — 

w  < 

►-  x 

VI 


—  —  <  to 
‘t* 


Z  uj 
»-  to 

3 


a  —  cc  t 
o  o 

•»  C 

•  W  O  . 

—  u_ 

>  a-  • 
i  u  j  a.  « 
O  C 
-  L-  * 
Z  U 
»  *■  < 

<  —  3 

—  O  u 

sf\ 

fC  —  M 

*  v»  m 

— f — i 

<  cl  > 


3  -  — 

—  O  a 
i/>  «  g 
Z  —  — 


5  -2 

CC  O 

>•  w  » 

Z  U>  o 


:  a,  v; 

»-  X  * 
-  Z  a  Z 


k-  *4 

Ui  to  — 

OnX 

*-  SN 
7*-  1 
—  Z  < 


O  Z  Z 
—  o  o 

->  z  X 
ary 

5SS 


<s\  -  y 
*-  u 
a.  «g  * 
7  Z 

»  UJ 

-V  <  u 
C  j  z 

ui 

x  .  » 
—  CO  «A 
O.  ft.  a 

O 

X 

T 

C 


—  o  — 

♦M  Z 

—  —  «U 
O  NO 

**  **  z 

•  •  3  Hi 
K  -j 

Ui  UJ  CO  -4 
.J  ^>  •  > 

a-u  j- 

I  H  <  ^ 

O  Z  UJ  o 
o  —  cc  tv 


099 

V 

N 

u. 

t 

o 

O  ti» 

1 

*4  »- 

OC  •«. 

•  a 

Ui  >< 

x  > 

X  « 

Ui  UJ  ^ 

o  u. 

-J  Z  o 

• 

a  u 

CL  ^4 

• 

X  *  — 

• 

o  z 

o  o  o. 

>- 

a 

V 

z  — 

U  3  to 

c 

< 

eg 

< 

a  z 

X 

X 

z 

rO 

♦ 

•  ac 

_  •>  * 

X 

Ui 

X  Ui 

> 

o 

X 

O 

*- 

1  UJOjO 

►- 

JO 

u. 

3  0-^ 

• 

X 

O  z 

•  —  o  a 

> 

a 

u. 

X  _  o 

a 

UJ 

99, 

O 

♦ 

(X  t 

1  z  —  « 

Q 

N 

UJ  *- 

<  a. 

z 

X 

V  »vi  •■ 

X 

3 

3 

> 

eg 

O  uj 

—  —  <x 

o 

X 

> 

a.  x 

CC  X  o 
g  -z 

«/> 

♦ 

►- 

• 

*»• 

• 

*  • 

ooco 

>- 

CL 

eg 

X 

*-  u.  > 

<  ►“  —  <* 

k- 

ft- 

►4 

eg 

X 

Z  O  J 

UJ  7  UJ  C 

M4  •■< 

V) 

a 

— 

> 

U«  UJ 

re.  ~  UJ 

« 

• 

K»  fti 

*4 

> 

X 

—  UJ  > 

*  o 

X 

• 

> 

o  >  — 

►  to  ►  — 

t  1 

rj 

u. 

• 

c 

•• 

♦ 

—  —  k— 

—  0/  v> 

C 

«■■* 

-J 

u.  ►*  o 

Zt-I  » 

► 

*-» 

O 

VI 

o4 

<3 

» 

< 

U-  <  u. 

1  Z  CL  Z 

« 

ft- 

*_ 

X 

rg 

U‘  >  Q. 

«— i 

rg  f'l 

UJ 

3 

X 

o 

r 

X 

c  •-  ^ 

_  »  ► 

♦ 

♦  ♦ 

o 

O 

o 

Onruj 

Vi  •>  v 

> 

—  ••< 

o 

< 

to 

♦ 

1 

z 

♦ 

uj  a: 

NL1J 

X 

a  a 

X 

ft- 

0-* 

> 

—  o 

*  X  -I  • 

« 

ft-  ft- 

•• 

mm 

Q. 

-m 

•-4 

> 

_i 

z  — 

oc  Z  Z 

» 

k- 

o 

> 

K> 

o 

o 

o 

> 

U.  *>UJ 

—• 

*— 

UJ 

Ui 

> 

O 

-1 

<  o 

a 

H  w 

Ox  <(J 

ri 

>- 

X  V 

• 

X 

Z  vi 

h 

< 

3 

3 

< 

UJ 

tO  |  >. 

UI3  J2 

W 

X 

Ui  Ui 

a 

ft- 

• 

3 

U. 

u. 

3 

o 

Lb 

♦ 

Ui  >-  ^ 

k-  UJ 

X  — - 

UJ 

C  3  u 

a 

U. 

a. 

X 

O  C 

X  *.  ► 

X  -* 

s 

z  z 

• 

UJ 

•4 

»— 

-4 

eg 

\ 

Ui 

V- 

<  1  3 

—  —  V)  CO 

z 

CL 

to 

«■« 

X 

X  .* 

u.  a  x 

eg  u. 

CO 

< 

ft 

N  l/l  UJ 

X 

tf\ 

M 

'y 

oc 

a  3  < 

^  -g  z 

•  o 

*4  *4 

z 

ft-  3 

i3 

eu 

V- 

u> 

a 

K* 

aL 

to 

N 

CL  Z 

ft 

X 

> 

•-• 

z 

* 

o 

o 

♦ 

~4  UJ  • 

o  z 

Ui  CO 

n  a 

mm 

ft- 

>‘*"» 

>- 

1 

1 

X 

CL 

c 

>  X  ~ 

o  • 

1 

w 

a 

«-4 

*4 

Ui^- 

o 

'3 

QC 

cO 

«  K  “• 

-1  X 

UJ  u 

eg  m 

> 

k- 

•u  7 

< 

li 

ft 

ft 

N 

a 

co 

z 

a:  — 

a.  r 

V-  < 

•-4 

•-«  — 

u- 

Ui 

DO 

O 

3 

Ui  Z 

3 

o 

uj  a  x 

r  a 

z  HI 

NOO 

*■* 

Kl 

U 

u.  a  a 

CL  V> 

a 

oj 

LO 

o 

oo  X 

—  o 

—  O' 

«  * 

0-9 

eg 

v_JO  O  U  O 

U 

_ 

___ 

_ 

_ 

O 

_ 

_ 

U  O 

_ 

_ 

u  u  o 

_ 

O 

u 

_ 

u> 

u 

o 

O 

IC 

IC 

IC 

IC 

IC 

IC 

IC- 

_ 

_ 

o 

o 

• 

1 

ft 

« 

* 

CO 

1 

UJ 

Z 

1 

ft 

Z 

<  v> 

ft 

ft- 

or  — 

1 

ft 

o 

» 

ft- 

• 

UJ  < 

» 

ft 

•» 

1 

l 

ft 

UJ 

rO 

k-  CO 

1 

* 

-J 

u 

— 

• 

t 

o 

u-  VJ 

• 

1 

1 

ft 

» 

z 

e%  > 

3  < 

o 

• 

$ 

1 

» 

» 

ft 

> 

< 

X  l 

3 

z 

k- 

1 

t 

99 

—  eg 

Z 

3 

» 

ft 

VC 

> 

<t  UJ 

•sj 

CL 

ft 

k— 

a 

z 

I 

ft 

z  •» 

CD  ft- 

X 

X 

1 

ft 

-J 

< 

1 

« 

< 

eg 

O  u. 

k- 

• 

1 

ft 

o 

•  Mi 

<  o 

1 

* 

MM 

MM  > 

3 

< 

UJ 

\ 

ft 

z 

eg 

UJ 

wO 

H 

1 

t 

CJ 

>-  * 

UI  to 

< 

<* 

ft 

V 

• 

o 

ft 

Z 

•  — T 

Z  <7 

T. 

M 

1 

Q 

ft 

001 

< 

eg  x 

ft-  uj 

a 

1 

eg 

ft 

*4 

o 

> 

► 

— 

ft 

—  eg 

Z 

v> 

X 

»  _ 

< 

1 

-s. 

< 

ft 

• 

lii  o 

X 

•  MM 

o 

m  rO 

to 

t 

ft 

X 

£ 

UJ 

— « 

X  * 

» 

O 

1 

— 

* 

ft 

IU 

l/l 

*—  ii 

>•  UJ 

k— 

_4 

z 

« 

VI  — * 

*  ? 
<n  *- 

CO 

»  II 

V! 

* 

ai 

>  eg 

■> 

-*  3 

ft 

M 

o 

8 

<J 

T 

Z 

CL 

1 

3 

X 

• 

Z 

a 

ft 

CC 

0-*  X 

O 

n  * 

M 

3 

— 

<x 

1 

r 

ft 

* 

tft  « 

»i  Z 

c 

k- 

o 

Ui 

ft 

O' 

t: 

a 

< 

i 

X  3 

H- 

k- 

ft 

k- 

a 

♦ 

ft 

■-i 

•  •— 

OJ 

► 

eg 

» 

i-» 

ft 

•-• 

Z  «  3 

>  o 

• 

w 

♦ 

Ik 

►» 

a 

• 

0* 

►— 

ft 

a 

a 

a 

< 

a 

00 

0- 

z 

3 

a 

a 

to 

<r 

OC 

9* 

z 

* 

a 

i 

a 

tu 

to  r 

3  3 

X 

k- 

'JU 

• 

y 

z 

UJ 

3 

3  r 

r 

< 

z  u. 

u. 

eg 

X 

o 

X 

• 

— • 

• 

UJ  Ui 

< 

O  X 

* 

—• 

* 

* 

UJ 

• 

.7 

X 

o  o 

< 

> 

X 

z 

— 

k— 

t- 

UJ 

UJ  — 

X 

Z 

— 

99 

V— 

ft 

U' 

V* 

>- 

o 

k- 

X 

k-  3 

C 

o 

tu 

-J 

k-  LJ 

►  cO 

3 

► 

£ 

z 

o 

• 

H 

H 

k» 

Z  — 

o 

• 

< 

«i 

M 

ft 

a  Z 

►4 

_ 

M 

•99 

♦ 

UJ 

* 

ft 

u» 

z 

UJ  •— 

x  a 

» 

n 

Ji 

~ 

UJ 

>r 

w 

n 

rt 

•» 

Ui 

ft 

X 

UJ 

• 

k- 

UI 

— 

c/> 

k-  f 

-J 

1 

H  ►— 

3 

99 

3  a 

< 

Ui 

Of. 

OO  y 

eg 

'M 

k- 

UJ 

— 

1 

3 

CL 

• 

3 

k- 

C. 

—*  LL 

3 

a 

* 

9 

z 

3 

ft 

ft- 

a 

iu  c> 

K- 

w- 

► 

3 

u 

u- 

<r 

1 

r 

*-• 

II 

— 

3 

— 

7 

z 

3 

a 

o  UI 

Z 

•—  II 

r 

— 

3 

*** 

Z 

7 

ft 

3 

3 

X  v 

3 

v— 

i_. 

eg 

k- 

— 

V 

r 

to 

—  o 

• 

e» 

uJ 

*«• 

w 

k- 

— 

5 

• 

O 

X 

ft- 

II  VO 

k- 

3 

Ui 

i 

1 

H 

«-• 

9* 

k- 

«t 

< 

z 

k- 

3 

a 

7 

_J  *li 

-J 

M 

— *  99 

H* 

< 

< 

z 

k- 

ft 

a 

w 

<_• 

k- 

k- 

“2, 

3 

•I 

a  ►- 

a 

n 

3 

k*  3 

i> 

z 

Ui 

k—  MM 

3 

Ui 

Ui 

7 

7 

Ui 

— 

C 

a 

ti 

UJ 

•» 

3 

<7 

J  z 

UJ 

“i 

o  — 

o 

O  O 

UJ  z 

ft 

3 

3 

t 

X 

‘V  > 

\A 

tf 

"* 

< 

1 

z 

o 

o 

a 

UJ 

V* 

k- 

M  — 

Y 

Z  3 

u 

O.  Ui 

ft 

O  Ci 

-J  ► 

1 

ft 

1 

1 

1 

eg 

eg 

« 

ft 

• 

c 

f: 


U  U  U  o  ' J  U 


182 


1 

J 

* 

: 

1 

1 

UJ  UJ 

1 

*  -J 

« 

o 

1 

Q  Z 

1 

Z 

z  < 

1 

t 

• 

u 

«X  — 

i 

1 

♦ 

M 

♦  cc 

1 

0m, 

—  •*  5-  UJ 

1 

< 

-4  ft-  X 

z 

1  z  X*- 

1 

X 

* 

a: 

—  UJ  o 

I 

« 

o 

3  cO  <  iu 

1 

» 

< 

« 

-J  UJ  UJ  CO 

mm* 

• 

i/i 

cc  3 

1 

< 

CO 

JC 

z 

<  a  cc 

1 

X 

O 

« 

c  o 

» 

OC 

« 

OC 

-  a  il  ►- 

X  1 

»- 

<r 

»- 

ft-  ac 

< 

i  <  z  Q 

o  t 

* 

z 

« 

i 

z 

tt 

3 

«-4 

f< 

* 

-j 

z 

~X  1 

w 

• 

X 

3 

“0 

* 

I 

s 

z  <  —  <** 

X 

y- 

1- 

». 

b- 

<« 

•* 

>  UJ  X  Cl 

*  1 

< 

WO 

X 

wO 

oc 

o  3  z  o 

» 

Z 

o 

*_ 

# 

a 

o 

•*4  1 

• 

9 

y- 

c 

CO 

« 

• 

•— 

O  —  CL.  —• 

1 

D 

« 

o 

i|i  ii  cO  ii  i  ft- 

_J 

CO 

* 

c 

ft 

00, 

» 

o 

O  U.  Z  33  O 

X  C 

» 

z 

ft 

X 

z 

<  UJ  <  UJ 

v» 

X 

»- 

ft 

u. 

JC  K  Q  1/1 
Q.  u  -j 

o  » 

wo 

a 

» 

M 

— 

*■ 

8 

l n 

ft 

* 

X 

a. 

X 

oc 

r 

ft 

z 

1- 

z 

T 

1— 

U/  3Z 

X 

1— 

—* 

> 

ft 

M 

a:  U  «*  C 

—  <  1 

•> 

o 

< 

•-< 

ft 

X 

t 

c 

VI 

—  «  X 

z 

•-* 

z 

<» 

ft 

•z 

UJ 

i^»- 

z 

z 

T. 

• 

ft 

M 

z 

>-«  <  1-  IU 

M  i 

UJ 

rvj 

ft 

c 

rg 

-J 

«.  UJ  cc 

*  •*  1 

«. 

X 

3 

X 

• 

Q 

-3 

ft 

> 

ft 

44 

i 

z 

a. 

<0^7- 

a.  1 

r 

X 

►4 

o 

y- 

OC 

—4 

• 

* 

ft 

V 

X. 

oc 

0-» 

t 

<1 

<  3  “  (L 

UJ  *  | 

< 

U. 

•4 

ft 

V- 

UJ 

< 

♦  o  ^  *-  o 

OH  -  1 

o 

O 

►4 

K- 

co 

o 

r.  < 

-4 

z 

W 

* 

< 

O 

T.  * 

« 

V- 

uj  o  3*0 

uj  —  —  co  | 

z 

►4 

Z 

W 

UJ 

UJ 

»-  z 

< 

ft 

X 

UJ 

—  X 

i 

UJ 

UJ 

*  *Jo  UJ 

►-  O  Q  CL  1 

w 

^0 

UJ 

c-  O 

» 

cO 

•k 

ft 

►41 

u. 

►- 

O  — 

z 

-W7rt  Q 

z  —  — '  1 

•> 

UJ 

Cl 

> 

z 

*-4 

1- 

Z  * 

M 

ft 

z 

—  o 

i 

ai 

> 

CO 

•4 

-J 

UJ 

1— 

CO 

O  ^4 

UJ 

►- 

■* 

►» 

O  >•  CO  UJ 

^  O  1 

z 

C 

z 

z 

V 

II 

-4  UJ 

ft 

Z 

^  4- 

i 

3 

*  <  a:  —  D 

*-*-!#  1 

C  uj 

< 

H 

V)  UJ 

►- 

# 

O 

v>  ft 

->  3 

UJ 

ft 

V- 

»- 

ft  < 

i 

o 

<  —  a;  X  a 

—  a.  a  i 

v>  3 

cc 

L- 

CO 

cc 

X 

*  Z 

3 

ft 

V- 

3 

*-4 

CC 

i 

3 

cc 

sc  <  v-  — 

o  UJ  CD  Ul  1 

z 

i- 

5C 

w  Z 

3 

z 

U  CJ  © 

<  — 

Z 

z 

ft 

3 

O  UJ  CO 

c 

O  <  o  z 

~  o  r-  o  1 

_J  •• 

-j  *“• 

<x 

C3 

o 

o 

o 

<M 

a 

ft 

O 

«  ft 

i 

0 

a : 

VO 

►*  >  Z  X 

jUJ  JUJ  | 

—* 

— J  ►“ 

rci 

<  »- 

3 

oc 

-J  UJ 

<  z 

♦— 

3 

ft 

oc 

O 

UJ  mi 

t 

c  • 

CO 

— • 

IL'OJU 

o.  >-  <  W  | 

<  z 

o  z 

►*»  O 

C3 

O 

Cm.  y- 

< 

8 

o 

7 

O 

® 

oc 

0- 

y-  < 

I 

Z# 

3 

X 

3  u-  oc  X  uj 

rz«jz  i 

o 

u  o 

< 

n 

o 

UJ  z 

3 

O 

X  Z  uj 

U.  C 

u» 

O 

UJ 

z 

ft 

3 

UJ 

r. 

Z  uj 

i 

UJ* 

CO 

ft- 

-  —  a  -*  l 

C3 

o 

o 

o 

u 

ex  uj 

•✓) 

»-4 

CC 

—  o 

O 

CX  WJ 

ft 

co 

r>j 

•"4 

—  ac 

I 

l 

» 

i 

l 

ft 

» 

1 

M 

rsj 

ft 

i 

I 

ft 

I 

1 

ft 

1 

• 

ft 

u> 

1C 

IC 

IC 

IC 

IC 

IC 

IC 

IC 

v_>  O 

u 

a 

CJ 

o  o  o 

u  o 

u» 

u 

o 

o  oo 

•m} 

*  —  ~  I  I 


11  H  II  M 


H* 

o 


o 

o 


>■ 

o 


<  X  > 
U.  O  O 


U.  X 

o  * 


• 

*  0 c 

•  -a 


»  * 
ft  n 

t  — 


'  —  3 

-  —  r 

'  >  i- 
V  Z 

c 


i-  h-  oc 
<  <  «r 
cc 

a.  Q  > 
i'  < 
IUH« 

O  <  nc 
3  < 
mJ 

*-  <  UJ 
•—  >  I 

uj  ►- 

>- 

cl  z 

aj  >  ~ 
O  — 

X  cr  O 

UJ  CU  UJ 

3  O 
-«  < 


uj  a  uj 
Ou.cc 
o  «r 
C  - 
►-  n  «/> 
c  ▼ 


Z  X  o 
—  *  <« 
«/>  u.  >- 
1-  < 
v  “ 

~  X  _j 


«  a 
x  > 

U  UJ 

-J  z 


>  *  ~  -4. 

I  W 

o  a  •— 

o  a 
X  co  ~4  o 
i  Z  —  ~ 
<  a, 
>-*»«► 
a 

oc  t  o 

»  to  *z 
-<  *  '3 

«X  t» 

Jj  2  JiC 
SC  —  UJ 

.2 

a.  co 

z  *-  X  - 
1  Z  a.  Z 


i/>  ►  T. 
Nl-  o 


3  *V  <t  cj 
u-  3  -j  Z 
H  UJ 

Z  X  *  * 

•—  —  i/>  vo 

u.  a  ft. 
»-  'x  _/  Z 


-j  r 
a  r 
X  *3 


»  X 
>-  JC 
O 


O  w 
'Nj  < 


ft-  < 
Z  uj 
—  a 


>  l  ♦ 

*  o  ~ 


»  H  W 
t  *  * 


x  *  > 


— •  x  : 

z  ' 


X  > 

.  c  o 


o 

< 

u. 

N. 


u  > 
<  a 


«  7 

*«.  # 
X  M 

a 

w  ♦ 

*-  x 
o  o 


x  ~  3 

r»  o  •—  7 

—  >  -  -* 


1-0  3  —  1-  <  7* 


185 


*  to 

*  z 

*  o 


: 1 
<  - 


CM  — 

w  O 


o 

o 

3 

O 

O 

o 


Z  —  —  ■—  -  * 


33 
-3 
oo*:  u» 
O 


j  -  3 
*0  2T  WJ 

A.  3  Z  Uj 


C 

3 

c 


o 

o 


- .  aj 

- X 


a.  a.  * 

»  k-  rg 

»  t/)  M  W  "< 

*  k-  a.  a.  a: 

o.  X  >  UJ 
I  K  OUi 


r 

o 


X 

o 


«  3 

«  3 

« 

«  * 


•  7 

•  «— 

*  3 


•  O 

*  7 


a.*-  z 

a. 

JJCWfi 

C 

3 

2 

0 

3 

N 

to  2 

“  *  »  *  3  z 

a 

z 

D  1 

O  UJ 

X 

4  uj  a  £  3  < 

1 

O  z 

— 

O 

Z 

Z 

N  3  — 

3  aj  3  aj  2  3 

a. 

O  1 

0  Z 

7.  X  0  cc  -  • 

—  0 

0  *- 

ro  0 

O 

3 

0 

a  w 

-j 

D 

C£  f 

Z  — 

z 

K  U  L»  UJ  w  O 

-J  x 

O  2* 

3 

O 

Z 

3 

j  **  z 

3  *  *  O  Z 

3  3 

fT  | 

V-  u>  u_ 

UJ 

u.»  H  V>t  <  -U 

CL  x 

U 

u. 

C 

O 

u. 

c 

O 

8  S 

IU2 

3  1 

ID  X  UJ 

X  vt  Z  3  3  >  10 

X  0 

O 

■** 

O 

O 

—* 

c 

0 

0 

a  uj 

to  | 

1 

-J  •-  c 

3 

»-  —  *•  T.  Z.  <  3 

3  0 

rg  3 

kA 

*7 

fO 

0 

>0 

O  f» 

I 

1 

AI 

1 

T 

<  /  ~ 
c  *- 


_ 

» 

1 

» 

_J 

1 

1 

< 

1 

1 

* 

0 

< 

1 

NJ 

1 

1 

1 

# 

t- 

1 

1 

1 

1 

• 

y 

1 

1 

aj 

1 

1 

» 

tO 

3 

• 

O  to 

1 

rg 

1 

1 

* 

X 

z 

1 

1 

£ 

1 

1 

• 

«■* 

« 

> 

1 

ft 

a 

1 

1 

1 

* 

AJ 

Ud 

-j 

1 

X 

> 

1 

1 

1 

k- 

u 

1 

UJ 

UJ 

1 

l 

t 

• 

» 

X 

1 

z 

a 

• 

AJ 

1 

i 

• 

to 

_J 

1 

a. 

1 

tJ 

to 

1 

1 

* 

k- 

< 

UJ 

1 

X 

1 

z 

1 

1 

* 

a 

M 

X 

1 

3  « 

a. 

1 

• 

X 

1 

AJ 

1 

• 

AJ 

X 

1 

u 

3 

tO 

1 

rg 

3 

1 

►- 

1 

# 

tJ 

1 

£ 

z 

1 

3 

-J 

1 

1 

* 

z 

UJ 

1 

* 

M 

1 

£ 

O 

1 

• 

1 

* 

u 

* 

c 

Aj 

1 

«» 

* 

» 

1 

0 

1 

1 

# 

X 

1 

> 

• 

«■» 

1 

• 

1 

►- 

1 

• 

a 

1 

1 

u> 

m 

ro 

1 

3 

UJ 

1 

1 

• 

UJ 

UJ 

UJ 

» 

C  3 

» 

«■* 

1 

,  , 

X 

1 

• 

1 

« 

CL 

x 

k- 

1 

0 

a 

1 

X 

3 

1 

1 

a 

>- 

►— 

t- 

1 

r 

to 

^4 

0 

1 

1 

f 

» 

* 

<f 

1 

1 

w 

t 

» 

3 

• 

» 

1 

« 

to 

a 

1 

< 

a 

3 

1 

Z 

3 

1 

i 

* 

UJ 

4 

1 

tO 

3 

1 

1 

a 

1 

* 

A 

1 

a. 

UJ 

1 

» 

£ 

w 

1 

M 

» 

AJ 

M 

AJ  AJ 

* 

to 

< 

u 

1 

£ 

3 

a. 

1 

A) 

1 

O 

1 

V- 

k- 

• 

a 

- J 

1 

ft 

tO 

>- 

1 

* 

Z 

a 

< 

1 

» 

UJ 

3 

1 

-J 

u: 

to 

| 

k- 

1 

O 

3 

1 

1 

« 

« 

• 

• 

ft 

cc 

1 

< 

M 

i 

M 

1 

• 

M 

< 

1 

< 

1 

* 

A 

1 

UJ 

z 

u> 

J3 

<■* 

1 

3 

3 

7 

1 

1 

*-• 

AJ 

AJ 

w 

X 

r 

O 

1 

cr 

a- 

UJ 

► 

1 

x 

4 

t 

« 

1 

AJ 

Aj 

aj  a* 

w 

u 

z 

1 

* 

0 

1 

r 

UJ 

I 

AJ 

1 

OS 

<£ 

£  C 

k* 

Q 

1 

>0 

•-4 

A. 

3 

1 

«” 

r 

1 

AJ 

» 

ft 

to 

1 

* 

to 

1 

1 

3 

3 

1 

Z 

£ 

1 

♦ 

♦ 

♦ 

ft 

M 

z 

3 

1 

z 

*— 

X 

• 

•> 

A“ 

3  X 

1 

_ 

ft 

l 

1 

1 

ft 

1 

» 

(X 

rr 

3 

3 

-*  UJ 

1 

3 

u. 

l 

1 

— < 

• 

X 

0 

UJ 

1 

1— • 

3 

**  n 

I 

z 

3 

5 

1 

►- 

k- 

ft 

r 

a 

1 

» 

*■ 

M 

*• 

A.  Z 

1 

fc- 

_ 

i 

A. 

1 

ft 

— 

X 

4 

• 

*/■ 

» 

7 

to 

> 

X  M 

1 

y 

Aj 

1 

*g 

£ 

1 

• 

» 

• 

» 

ft 

3 

—• 

i 

AJ 

k- 

tJ 

k- 

1 

\ 

X 

1 

• 

1 

ft 

a 

u- 

1 

ft 

O 

3 

a 

3 

1 

<0 

1 

X 

1 

AJ 

A* 

ft 

k» 

“T 

1 

O 

Z 

AJ 

AJ  -• 

5 

A' 

1 

AJ 

1 

ft 

z 

«4 

1 

1 

ui 

k—  3 

1 

4 

i 

« 

6 

r 

— 

» 

AJ  33 

£ 

x: 

*< 

CD 

£ 

ft 

u. 

X 

to 

3 

• 

t* 

< 

3 

X 

* 

O 

w  w 

1 

/V 

X 

1 

u. 

•— 

£ 

1  z 

* 

Z 

» 

•• 

ft 

O 

CC 

C 

1 

UJ 

c. 

->  z 

AJ 

to 

1 

3 

1 

>- 

0 

1 

J* 

H 

Aj 

M 

H 

ft 

0 

UJ 

UJ 

z 

1 

UJ 

0 

a 

1 

AJ 

1 

• 

I 

* 

* 

» 

X 

« 

X 

k— 

«r 

1 

z 

X 

** 

3  — 

1 

Ui 

X 

1 

—• 

— * 

■— 

w* 

» 

ft 

UJ 

>- 

UJ 

•  « 

1 

— 

•— 

vo 

»/> 

■c 

A. 

0 

< 

1 

— • 

» 

t 

4 

— 

A# 

< 

4 

> 

z 

J 

a 

1 

wL 

Cl 

0 

'T 

— 

— 

*  AJ 

1 

3 

T 

l/V 

t 

L- 

• 

<r 

l  u 

y 

y 

U-  N 

* 

»u 

ft 

u 

< 

>- 

1 

k» 

-A 

3  Z 

ft 

ft 

X 

UJ  ft 

» 

3 

3 

X 

1 

— 

a 

1 

u 

il 

3 

n 

M 

A. 

3 

ft 

k- 

L- 

a 

1 

— 

a 

X 

3  CL 

1 

"■ 

r* 

1 

'  J 

u 

X 

•  V 

V 

X 

Z  V 

V 

V 

z 

Z 

<1 

3 

«J 

*1 

w 

1 

3 

z 

UJ 

U.' 

4" 

<  U* 

1 

■\ 

V 

c 

• 

1 

Aj 

w 

Aj 

M 

•X 

rr 

« 

r 

a. 

X 

1 

n 

O 

3 

ft 

>  O 

1 

a 

1 

j 

UJ 

j 

u- 

k~ 

4 

< 

k-  AJ 

V- 

►* 

< 

k- 

3 

ft 

u 

U> 

• 

X 

uj  a 

3 

—  tu 

1 

A. 

3 

• 

>L 

►- 

4 

> 

z 

2" 

L- 

C 

V 

“ 

/ 

UJ 

» 

a 

> 

A 

> 

<1 

3  »- 

f 

3 

a. 

1 

5. 

z 

1  52 

0  3 

c 

UJ 

z 

ft 

3 

u. 

X 

1 

r. 

3 

z 

O  UJ  0  T 

1 

10 

4 

< 

1 

X 

»  c 

0  0 

0 

uf 

UJ 

ft 

1/ 

0 

r 

1 

u 

O 

u 

u.  — 

1 

| 

l 

1 

ft 

1 

ft 

» 

( 

1 

» 

—* 

AI 

ft 

1 

1 

t 

1 

ft 

» 

1 

1 

1 

ft 

1 

1 

1 

1 

ft 

1 

1 

u 

c 

c 

O 

0  0 

u 

O 

Uj 

C 

C 

C 

c 

c 

0 

O 

iu  to 

V  Z  « 


'  a.  <x  L 

x  ai  ui 
—  O  C>  Jt  _i  3 

a  o  z  <  _» 

3  «<  1  •  - 


—  “*  to  to 
u  cl  a. 
>■  -J  z 


oljv  0000000000000 


H  -> 

I 

>  rvj 


13U 


f; 


|  M.  | 

3  UJ 

1  AJ  | 

X 

U. 

uj  ro  £ 

ri 

X 

O 

x  0 

1 

UJ 

3  0 

1  0  VO  ■ 

s  ; 

3 

3 

>  3  ui 

1  3  3  1 

i 

< 

Z  •  00 

1  •  a  l 

2 . 

x  a 

CC 

3  •  cc  UJ 

1  x  >  * 

3  z 

O 

•  <  >  0 

1  Uj  UJ  3  1 

1  < 

UJ 

to  •  O  <  3 

1  3  Z  Ol 

z 

3 

Z  ♦  t 

1  a  3  1 

0 

z 

cc  C'  to 

1  X  »  v.  | 

*  ’A 

OJ 

—• 

•  UJ  M  3  C  3 

1  0  0  a  1 

{ 

X  U. 

UJ  3  ,1 

t  0  x  tO  | 

*—  3 

ID 

3  —  to  ro  tj 

1  co  z  1 

• 

17  0  -0  3  rr  z 

4 , 

r  iu  cr 

3 

Z  3  A  3  s  3 

k-  > 

<  3  M  UI  u. 

1  >  ••  •  1 

—  *-  u. 

«/k 

3  <  3  y  3 

1  1  won  1 

►*  C 

M 

a  a  X  u.  Z  Z 

ICO  *  ~  1 

< 

3  O  3  UJ  3 

1  *  3  c*  a  1 

•  ■» 

->  *U 

X 

<  A-  <  5. 

1  x  ^  3  i 

t 

3  > 

3  D  z  3  D  2: 

1  1  Z  3  3  * 

X  3 

ID 

<  Oaj^oC 

l  <  a  1 

! 

ug  ►” 

CL 

u  y  cr  3 

|  ^  •  to  •  | 

1  i 

C.  < 

UJ 

-u.wit<k* 

1  3  3  a  1 

•  > 

Z  r  *-  C  u 

1  X  g  : 

1  •  to  •  2  1 

•  X  **• 

5 

U  3  4  (x  «. 

•  3  ^ 

Z  u  Ud  3 

1  j  u;  0  1 

1  UJ 

<  U.  3  X  U. 

O  C,  2  3 

1  <  3  3  •  1 

-  • 

•  >  C 

X 

1  j)  z  an*  1 

i 

(3 

UJ  Uj  O  X 

ui>i5zu 

1  at  aji  1 

f  ‘ 

>31 

• 

1  A  O  | 

»  3 

X  Z  3  0  <  3 

|  •  tO  •  3  j 

i  ^ 

A  X  1 

3 

>  <  uj  X 

|  —  Cl  to  | 

>  c  >• 

to 

•  to  j; 

I  Z  3  x  *  1 

X  3  o 

Z 

CL  U.  Uj  • 

1  1  Z  CC  z  1 

”> 

O 

U>C  J4ZUI 

*  u>  ) 

>  tt.  O  3 

X  X 

to 

c  to  <J  >.  3  < 

•  to  •  X  1 

i 

X  k-  c. 

UJ 

3  3  <a  3  0 

|  AI  3  0  1 

j 

■> 

3  3 

Za  <*“• 

1  *  a  3  *  » 

i 

-  to 

< 

V»W4ttf  O 

">ujOz 

1  fx  z  7  | 

l 

>•  <  u. 

OL 

i  IL  »  U~  | 

UJ 

ft  O  O  ID  3 

1  7N<U  1 

a  Ci  3 

z 

3  OJ  <  3 

1  UJ  C  3  Z  1 

X 

UJ 

a'kUjZC 

1  3  Uj  1 

L 


1.. 


I 


i 

>- 

o 

> 

0- 

< 

H- 

cc 


-J  A* 
<  -* 


-o 
sc  o 


a 

sc 

5 

-j 

• 

X 

UJ 

I 


o 

►- 

o 


o 

& 


3 

cc 


AM  *  A 


o 

aC 


AM 

L.  O 

o  • 

VS 

o 

MM 

*4  3 

A 

1 

• 

, 

3  O  vs 

3 

k- 

3 

> 

&  z  -J 

a 

a 

z 

1 

-J  z  > 

—  CD 

—  o 

O 

AM 

a 

T 

X 

1 

z 

• 

<  v» 

Z 

3 

o  > 

ac 

o. 

«I  *» 

UJ 

O 

o 

« 

I 

1 

MM 

o  — 

c 

-J  ♦ 

OL  — 

X 

X 

z 

2C  IU  *"* 

• 

o 

O 

X 

1 

MU 

Z  X  UJ 

—  UJ 

Uj  a. 

UJ 

• 

O  3 

a  — 

UJ 

u. 

1 

—  X 

k- 

2  —  vs  _j 

A* 

O 

O' 

A 

A  A 

•»  H  sc  -> 

a 

Ci 

3 

1 

as 

V)  k- 

< 

—  cn  ♦  uj 

n 

2 

UJ 

—  H 

a. 

•>4  M 

a. 

z 

• 

1 

M 

3  uj 

2 

X 

►  w  <  K 

•  ac 

< 

• 

VS 

sc 

Cl  am 

—  N  * 

• 

z 

z 

1 

* 

X  vs 

— 

ZOsCVSOUl  —  « 

O  U, 

X 

a  — 

—  — 

X  VS 

—  x  a. 

* 

• 

— 

( 

CC 

— 

CD  k-  UJ 

C>  {i.  kj  vs 

‘O  (I  “  M  M  >  • 

•  ♦  VI  N  -UJ  {* 

O  —  N  UJ  w  0.300 
*4  OX?**! 

•  Q.  «  u.  —  C  am  C 


>  — 
—  ^ 
a  -* 
x  vs 
O  a 


■  a.  vs  (x  ■O  Doj 

x  a  N  Si  M  K  II  “•'*  j3 


3 

< 

Z  X  *“• 

—  vs*-ec  —  >-•»»- 

MM 

- -  C  HhuJjS 

►—  3 

Z 

o 

Z  vs 

3 

m  vs  vs  >  •-* 

z  z 

-0 

O  '1  UJ 

O 

U 

x  cl  a  —  cl  Q 

c  — 

k- 

O  —  k- 

z 

—  x  a  x  ° 

o »- 

Z 

3 

*< 

z 

o 

< 

X  o 

o 

o 

O 

z  o 

o 

z 

ac 

3 


—  O 
O  ac 
O  u. 


o  — 
-HZ 
ui  <  tr 
*-  X  3 

*-«K 

aC“> 

j  u.ti 


—  o 

"*  cc 
o  a. 


iu  <  ac 
k-  x  3 
—  a  *- 


*  > 

*  _J 

*  o 

*  vs 


*  3 

*  O 

*  « 


J  ar.o  *  cs 
ju-«  2  •  3 
'u  •  vs 


orr- 

uj  »*  vs 
»-  O  O  O. 


*02 
*“  #  J  O 

—  a  — 
OujO  vi 
-0*2 
Jiu  JUI 

Q.s-<y 
T.  Z  ai  — 

—  -  JC  c 


cc 
a.  a 
o  >- 

O  — J 
UJ 

o  r 

UJ 
UJ  k- 

z« 

~  > 
k-  v> 

3  « 

DC C  > 

a  < 

as  uj  » 
3Z 

VS  am  X 

-I  * 

r  3 

OX 
CC  <  X 
a.  -mu 


h  i/i  j 
UJ  o 
X  >  vs 
•»  J 

<  o  z 

i/)  w 

v>  X 
uj  ►—  v- 

>  vs  _ 

j  ct  Q 

a  ~  z 
vs  u.  < 


— — — 

*— 

1 

— 

— 

— 

— 

*— 

— 

““ 

— 

— 

— 

— 

~ 

—  * 

-  —  —  — 

— 

— 

— 

— 

—— 

— 

~ - 

— 

— 

— 

— 

~ 

“  ~ 

— 

— 

— 

1 

• 

< 

1 

* 

X 

» 

k- 

• 

o 

1 

* 

AJ 

cc 

1 

c 

cs. 

UJ 

1 

* 

V) 

UJ 

2 

►4 

1 

« 

QC 

U 

1 

Ik 

3 

< 

UJ 

1 

* 

c 

2. 

1 

o 

* 

uj 

3 

ac 

1 

< 

* 

~J 

u. 

Z 

1 

u. 

# 

-J 

o 

3 

A 

* 

1 

Q 

1 

O 

MA 

1 

« 

1 

QC 

Z 

—J 

1 

CC 

• 

< 

« 

_J 

> 

< 

* 

AJ 

-J 

< 

•—  » 

3 

-J 

* 

1 

3 

< 

o 

Q  k- 

A 

a 

# 

3 

O 

X  u 

M 

1 

>• 

AJ 

* 

z 

• 

LL 

UJ 

o  > 

AM 

* 

#• 

< 

QC 

r 

3 

t 

k- 

» 

« 

3 

— 

M 

o 

k- 

Q- 

-i 

> 

o 

X 

* 

-J 

i 

cr 

k- 

3 

A 

1 

•< 

* 

< 

k- 

o 

— i 

z 

M  «■ 

vs 

• 

o 

1 

UJ 

X  U 

as 

<M 

» 

* 

< 

u 

> 

>• 

Z  AJ 

< 

o 

a 

kA  1 

—  n 

A< 

» 

* 

CO 

k— 

(C 

X 

t-t 

vs 

o 

cC 

O 

lA 

lA  1 

o  u 

-J 

* 

* 

* 

z 

k— 

M 

»  VI 

UJ 

k- 

3 

* 

VI 

-J 

3 

z 

CC 

.J 

p 

*■>  i 

►  • 

** 

k- 

X 

AJ 

a 

< 

u. 

k— 

CL  a 

X 

CL 

O 

3 

X 

>»  V 

Ol 

* 

UJ 

•A 

■< 

M 

k- 

< 

*  O 

or 

UJ 

O 

H  ~k 

o 

c 

X 

• 

* 

cc 

a. 

?. 

z 

AJ 

>- 

►  i 

* 

X 

a  £D 

«A 

X 

— 

a 

» 

VS 

* 

z 

* 

3 

CC 

UJ 

a 

3 

"S  o 

3J 

cr 

«k 

IA  | 

X  X 

o 

a 

a 

««. 

r 

X 

C 

:c 

y 

x  * 

z 

X 

k~ 

cC 

-* 

L*S 

us  i 

—  3 

o 

o 

< 

~j 

» 

• 

z 

»-« 

ac 

QC 

< 

X 

< 

A 

AM 

AM 

4. 

a 

lA 

—  1 

««• 

8 

A 

_ 

AJ  U. 

w 

•»k 

z 

1 

o 

<? 

LJ 

vs 

•  LU 

AM 

M* 

3 

*4 

■X 

X 

—  i 

“S 

“S 

3 

til 

«_ > 

-> 

a 

A 

AM 

>- 

X  >  >“ 

V- 

o 

• 

w 

w 

* 

« 

» 

c 

w 

Z 

c. 

UJ 

M  A 

2 

«t 

< 

• 

o 

» 

VI 

o 

*s 

X  >* 

* 

< 

k- 

-J 

a 

»  i 

z 

■> 

“S  -S  3  -S 

z 

AJ 

• 

Ik 

uJ 

k- 

A  vA 

8 

z 

N 

o 

Cl' 

UJ 

> 

• 

— 

• 

• 

* 

* 

•-4 

* 

z 

V) 

as 

X 

A 

k- 

• 

k- 

A 

*— *  | 

K  « 

c 

« 

o  ♦ 

♦ 

>  k- 

» 

Cl 

•A 

u. 

A 

H 

•f 

IT, 

*  X  X 

z 

r 

* 

UJ 

A 

r-- 

UJ 

-j 

as 

A  VS 

o 

z 

AM 

mM 

X 

r 

y.  —  - 

vA 

X  O  C 

• 

*•« 

M 

♦ 

* 

O 

O  X  X 

w 

X 

VS 

-j 

->  UJ 

AM 

X 

(X 

r  i 

z 

o 

••  o 

X  > 

M 

o 

# 

O 

UJ 

*• 

v> 

k* 

a 

o 

-J 

Ci 

A 

R 

>1 

R 

• 

a 

Ul 

or 

2 

AM 

On  x 

X 

o  w  w 

— 

Uj 

»- 

k- 

k- 

3 

* 

UJ 

k-  O  O  0. 

•— 

k- 

•  < 

Ul 

— 

AM 

a 

AM 

•— 

O  i 

C. 

* 

c 

z 

— 

Ul 

—  o 

1 

AJ 

A 

n 

*»• 

Z 

a 

Z 

—  i 

—  >  > 

>-< 

X 

> 

H 

k- 

X 

z 

* 

*-• 

VS 

• 

< 

o 

vs 

“ 

o 

II 

A-« 

:c 

3 

— 

AM 

a 

UJ 

'J  •-  eg 

AJ 

M  I 

»• 

3 

< 

UJ 

3 

2 

3 

< 

UJ 

-J 

AM 

-J 

3 

-J 

z  i 

II 

o  n 

*s 

3 

3 

II 

2 

UJ 

X 

# 

Z 

k- 

-J 

a. 

X 

Ul  QO 

< 

II 

AM 

V 

c 

z 

< 

x  x  a 

2  UJ  UJ 

II 

M 

II 

k— 

o  u. 

a 

w 

3 

k- 

* 

a 

2 

< 

o  ♦ 

— 

a 

o  o 

a 

U.  LU  UJ 

®  t 

*>4  0  0 

•— 

M 

AJ 

k- 

X 

X  k- 

O  2 

* 

k- 

U  uj 

CP 

V 

c 

rc 

-J 

u.  _J 

L- 

o 

V* 

AM 

— 

o 

o 

AM 

vs 

o_,  c 

O 

•-< 

>— 

2 

* 

3 

o 

* 

2 

3 

k~ 

o 

k-  < 

A- 

a 

uj  a  uj 

UJ 

3  ! 

*-* 

X 

V  o  u. 

o 

O 

K 

-l  3 

n 

-J  UJ 

-J 

LU 

Ul 

* 

< 

iu 

2  W 

Z 

UJ 

o 

►-  X  *- 

l— 

k- 

k~ 

►- 

3 

O 

2 

* 

nc 

a 

k— 

> 

Q 

mJ 

—  u. 

Aj 

" 

UJ  | 

<i  <  < 

o 

C 

<  uj  2 

» 

u 

X 

z 

— 

—  o  — 

— 

o  o  o 

r> 

o 

U  a  u 

s 

uC 

O 

1 

» 

vs 

i 

AJ 

- 

»* 

• 

fNJ 

I 

i 

« 

o  t  j  o  o  u  u 


o  u  o 


o  o  o 


185 


Assembly  of  the  Equations  and 


Solution  of  Dirichlet  Problems 


n 

U 


O  Iuj< 
z  i-  u> 
a  i  ^a. 

2  -ui  a 

X  IL  h> 

-  c>  <x 

7  ft  Z  < 

—  <  a 

u.  X 


:  z  c 

■  <  a 

>  of  ftl 


■  a  d. 

C  <f 
•  *-  T 
O  O 


!  o 

U  Ui 

•  _i  X 

'  c 


(/>  a 

ui  ui 
cf  ft 
C'  Q. 
C  < 
O  S 


a:  o  o  <\j 


i  gf"  2  ui 


.J  ►- 

<r.  » 

►*-  2 
—  C  l 


IX  *T  -«  't 


c  * 

.  —  ft 

i  x  u 


't 
Ui 
X  V 
u. 
5  C 


-  ft 
C 

©  cu 


)  Ui  ft 

x  z 

»-  o 

U.  X 
I-  *• 
a 
: x  -x 

c  c 

.  a  o 


ft : 
u. 

>  * 

I  V 


Z 

jC 

U 

z 


•  ut  x 
Ui  »« 

.  ►-  — j 


a  u 
a  u. 
-j  x 


£ 
2  x 


ui  w 
X  < 
X 


—  *-  U>  ~J 


>  z 

a.  ~ 


u*  O  • 
Z  x  a*  < 
z  *- 

c  X  t 
r_>  i_  * 


u  1/1  J  ( 

if  f  -  - 


Z 

Ui 

of 

% 


(ft  — 
Ui  ft. 


U.  -/ 

o 

Ui  b- 
i/'  o 
X  ui 
> 
CL 

n  u* 

—  x 

ui  h» 

>  C 
>- 

I 


o  >- 

§“ 

c  c 

rc  u. 

Z 
u>  O 

ft  x 

x 

X  ** 
p  u> 
CO  < 
<X  It 


CL  O'  of 

o  c  c 

uu.il 


C  O  r 

Z  ft*  Z 


XXX 

u^  U>  Ui 

C  C\  c 


u;  O  . 
X  _j  « 
of  <  : 

►-  > 

o:  3  : 
c 

U.  Of 

C  C 

ft  U.  I 


C  U.  L 
Z  O  C 


UI  X 
CO  X 

■y  < 
z  ~ 

I 


> 

c 


I  Ui  —  —  — 

<n  -<  m  i*' 

-r  ►  *  . 

#  ft  —  —  — 


:  c  c 
•  z  z 


LOC  *  - 

Ir'??  3' 

—  • -  c 


C7  Cl 

Ui  C  ’ 

ml  T  Z 
w  ►-  c2  i 
■"*  u>  ' 
T:  x  i 

c  > 

Ui  a  • 

_i  x  <  . 

~  j  c.  : 

«t  5  ;• 

<c  ■? 
<  >  <C 
> 

*-  >  . 
X  »i  (Z  » 

x  *■-  c : 

>~  Ui  b 
X  — 

»  «-  u  b 

ft  ■  O  1 
O  «U 
—  —  a  i 
<£  X  X)  » 


Ui  Ci 
U. 

r  — 

_>  — i 

y  a 

—  cv 

X  2 
<  1/1 
r 

i/i 


"D  _l 
O 

*  i/i 

c 

Of  ‘ti 

x  X 


►  o 


*  '*  ft| 

OO- 

x  z  a 

CD  z 
—  *03 


(/!■«* 
c  — 

Z  -  rft 

cr  ui  — 
z  a  _j 
o  c* 


v*  i/i  v» 
/  J" 
<  Of 


—  (I  TT  U'  — . 

*K  /uj 
U<  ►“  Z  O  — 
a  ui 
*  -  z  * 
•  c  w  1/1 
~  -  -x  *  ** 


c-rr 
>  r  n 

—  Ul  u) 


•w<-»«->ouoou,oo«-io«-iOUoooooog>owu»ooog>owooo»-iUOUoou>oOLj'i-uouuooo 


>-  <r 
z  z 

UI 

Of  -i 


(O 

r 

«  r-> 

i/'  — i 

C 

c: 

z  z 
o 
u 


Of  ^ 
<  OJ 
a  xi 


— 

— 

— 

— 

— 

— 

“ 

— 

— 

—  “* 

— 

— 

— 

“  ““ 

— 

— 

- 

— 

— 

— 

— 

— 

— 

— 

— 

~ 

— 

— 

— - 

liJ 

“ 

— 

— 

_ 

— 

— 

MM-M 

c 

c 

u> 

z 

c 

tt 

>- 

c 

• 

UI 

# 

-J 

• 

u. 

u- 

« 

Ui 

U. 

Ui 

u 

X 

•J- 

UJ 

b- 

b- 

C 

UI 

b- 

3- 

ft 

* 

u* 

u. 

ZJ 

C 

.J 

Ui 

u 

b- 

1 

Ui 

O 

ft 

« 

u* 

i 

u« 

b-  C 

z  < 

UJ 

* 

X 

ft 

p 

Cf 

»£ 

T 

XI  Z 

»• 

> 

b- 

ft 

u; 

X  • 

b— 

X 

X 

< 

ft 

t 

or 

g 

b— 

UI 

a 

o 

< 

X 

U' 

i 

0l 

< 

X 

• 

3 

tc 

a 

Ui 

—  I 

a 

• 

o 

X 

ft 

Ul 

b- 

z 

u 

ft 

cf 

X 

t 

■X 

XI 

«! 

U.  b- 

c 

*r 

l 

< 

X 

b-  b- 

1- 

b— 

< 

* 

z 

X 

«.J 

1 

X 

V 

lu 

•—  X 

3 

u 

‘t 

• 

Ui 

Ui 

< 

XI 

X 

Ui  X 

V 

Ui 

u 

UJ 

z 

n 

V 

a 

ft 

UJ 

* 

-J 

x 

UJ 

—* 

u 

1  • 

X 

*— 

cr 

i— i 

U 

z 

Mi 

b* 

w  VI 

O 

UJ 

<1 

— 

Cl 

ft 

Ui 

-4 

b~ 

>. 

V- 

X 

Ui 

b- 

u 

b- 

i 

z 

b- 

—  CV 

e  a 

Mi 

<? 

> 

z 

*■> 

< 

b- 

—  / 

b- 

<1 

z 

o 

<* 

, 

z 

Ui 

3 

_• 

JI  C 

U.  b- 

X  O 

u> 

ff 

u 

Ui 

c* 

> 

z 

^  «t 

< 

a 

a 

MM 

b- 

a 

ft 

» 

x> 

CO 

-J 

Ci 

c 

u 

ft 

< 

— 

•* 

Cl 

ft 

c 

u» 

•Mi 

UJ 

b- 

1 

ft* 

# 

of 

o 

-f 

z 

Xs  _ 

u. 

ft. 

u- 

<* 

< 

ft 

> 

>-  ,* 

UJ 

i. 

O 

ft 

MM 

or 

« 

'J 

b- 

XI 

> 

'J 

u 

Z  V- 

b- 

Ci 

LU 

b— 

U 

•u 

ft  MM 

ft 

UJ 

b- 

L* 

Ui 

ft 

• 

- 

z 

o  — 

z 

•Mi 

ft. 

Of 

_i 

I”  «J 

X 

O 

ar 

b— 

* 

XI 

—  X 

X 

Ui 

c 

X 

ft 

■* 

a 

C 

UJ 

* 

1 

* 

z 

> 

• 

> 

*» 

7 

u. 

•x 

b— 

Ci 

MM 

U. 

a 

Ui 

«t 

» 

.* 

u 

Z 

7 

* 

_i 

UJ 

XI 

cr 

U  XI 

—  o 

Ui  ft 

X 

»— 

Z 

MM 

ft 

e 

O 

j 

b— 

«• 

c 

X>  Ui 

-J  Ui 

Ui 

_! 

b- 

< 

u. 

X 

n 

X 

X 

ct 

* 

a 

< 

_ 

XI  _i 

U  b~ 

Ui 

z 

• 

X 

X 

7 

U> 

M* 

CL 

. 

X 

Ul 

L- 

< 

» 

XI 

r* 

x  M 

a  <f 

r. 

< 

> 

ft 

lil 

o 

of 

7 

1 

a 

• 

> 

< 

c 

<  1- 

•-i 

UJ 

> 

ft 

L* 

MM 

*- 

ft 

rv 

a 

*-« 

X 

*» 

C* 

-i 

a 

a 

i-i 

T 

X 

ft 

UJ 

b- 

Ui 

a 

c: 

* 

J- 

-J 

< 

V'  VI 

X  o 

u 

f 

*• 

z 

’S 

—  cc 

J 

o  VI 

Ui 

(2 

H 

ft 

* 

z 

_i 

< 

3C 

*  C 

»-  ’/l 

«» 

u 

r 

“* 

7 

>'  Ui 

< 

i 

<T 

■v 

X 

» 

i  r 

> 

Mi 

c 

v  ►- 

a 

u 

c. 

« 

•  * 

x> 

a 

► 

u  < 

1  Li 

UJ 

ft 

X 

Ui 

ft 

a 

ft' 

■c 

VI 

u 

v 

b> 

•i 

z 

-i 

wJ  XI 

b- 

t 

t 

a 

CL 

►- 

ft 

C.  2. 

o 

ft 

o 

• 

~ 

> 

b- 

o 

of 

;  -> 

Ui 

r* 

«' 

a 

< 

•t 

Ui 

ft  < 

* 

? 

o 

7 

<: 

b- 

a 

* 

~i 

xi  a 

a 

c  a 

ft 

u. 

lli 

Ji 

Ui 

1 1 

b- 

-J 

• 

<N 

*-  c 

P 

Ui 

ft 

1* 

C 

mJ 

1“ 

—I 

•J 

z 

ft 

X 

XI 

r 

• 

• 

«f  s 

Z  o 

O 

b- 

IJ 

r- 

V- 

a 

O 

•M4 

L 

ft 

r 

z 

w 

» 

z 

ui 

X 

X) 

Cl  — 

ci 

” 

X 

ft 

a 

Cf 

z 

b~ 

► 

•-« 

•-« 

i 

L 

b- 

XI 

c 

A 

u 

< 

1' 

X' 

ft 

Z 

M 

< 

A 

c 

* 

XI 

(l 

XI 

O 

a  r* 

T 

> 

•i 

•: 

U. 

< 

X 

b- 

X, 

7 

'I 

»T 

z 

* 

o 

XI 

r. 

»— 

b- 

tl 

tr 

V 

X 

Xl 

1 

o  ^ 

• 

a 

r 

X 

41 

T 

> 

■v 

> 

U  Ui 

* 

4‘ 

z 

ft 

b- 

ft 

c 

a 

; 

Cl 

u- 

— • 

z 

♦  a 

UJ 

>- 

b- 

Li 

•— 

p; 

z 

Ui 

ft 

*— 

C 

X 

i 

M 

Ui 

b- 

wb.  XI 

»/}  z 

—  UJ 

*— 

M 

< 

II  ft 

u. 

• 

u 

Ui 

XI 

1 

— 

Ui 

.  — 

If  K" 

ft 

b— 

1 1 

ft 

X 

1 

z 

.. 

Z 

Z 

cr 

1  ( 

b— 

,2 

»•  u* 

z 

z 

u 

I  — 

'  Ji 

z 

Ui 

o' 

«u 

1 

Ut 

«t 

l»i 

X 

> 

U  z 

r 

*-« 

ft 

•— 

J. 

V 

*** 

rr 

b“ 

*; 

(— 

> 

u 

n 

•— 

o 

Cl  C* 

Z> 

»x  «: 

x  -1 

*■-* 

w 

►— 

o 

-J 

X 

2' 

•r 

•) 

** 

'ft 

-J 

# 

•— 

S 

•— 

o 

o 

►—  3 

UJ 

<[  a 

y~ 

* 

Ui 

• 

*1 

O 

M. 

-*  ft 

w 

a 

MM 

A 

M 

i 

- 

J, 

u 

p- 

< 

x» 

►* 

lw  ft 

Xj  < 

u. 

r  . 

J 

z 

» 

cr 

> 

UJ 

X 

fO 

o 

ft 

X 

r* 

b- 

1 

X 

— 

Ci 

XI 

u. 

u. 

< 

X 

- 

•i 

*1 

X  ;i 

z 

a 

A 

U  ’ 

X 

w* 

u 

c 

>  UJ 

1-  < 

c 

X 

*-• 

t— 

1 

1 

1 

Wi  ~ 

ft 

U 

X 

b- 

Ui 

- 

r 

— 

<1 

u> 

r 

a 

*- 

"J 

»: 

b- 

u- 

•— 

r 

O 

z 

i  b- 

1 

1 

ft 

U> 

» 

z 

V* 

w 

«• 

f 

( 

z 

1  — 

* 

f/ 

•Mi 

, 

,, 

< 

O 

or 

X 

1 

C 

, 

a 

ft 

— 

.1 

» 

M 

» 

X 

cf 

ft 

**♦ 

» 

,4 

Mi 

■» 

u 

4'  1 

'N. 

f 

, 

X 

** 

Mi 

- 

* 

< 

■» 

•-• 

X 

<1 

-o 

b- 

*1 

b- 

• 

b— 

O 

_i 

c. 

>•<> 

>— 

— » 

^  a 

z 

t* 

ft 

3 

ft 

<• 

— 

V- 

u 

T 

Xl 

<! 

X 

a 

X 

X 

ft 

• 

X 

“1 

> 

• 

3 

— 

> 

“ 

•” 

*** 

1 

1-i  V 

C 

c 

Ui 

Ui 

o 

U  ui 

C 

C 

Ui 

C 

c 

Ui  u 

C 

C 

c 

c 

c 

c 

Ui 

c 

c 

c 

uuou 

C 

C 

UJ 

Ui 

ui 

Ui 

C 

c 

o 

C 

C 

c 

c 

L 

C 

C 

c 

u 

ft 

■J 

ft  ft 

UJ 

u> 

ft 

Ui 

c 

c 

L 


L. 


186 


n 

u 


n 


L 


Z 

o 


•-*  CO 


cr  <r 

'''zz 


f\»  o  — 

— ‘Z 
11- 
►-  Z  ^ 


ti  it  *i 

C  -*  »NJ 

c*  3  3 

Z  v/>  <7) 

_>  -J  -4 


40  o  ♦  *"*  ♦ 

X  ~  ♦ 

</>  cr  ro 
Z  Z  x  a:  >> 
a  ** 

ft  ft  O  X  H- 

z  a 

m  </i  Z 

*  t- 

♦  ♦  a  ♦ 

rvj  z 

<\j  uj  it* 
X  o  ♦  4  «✓> 

3  —  < 

</i  </>  or  a  x 

-I  -j  z  -j  _f 

a 

N  n  •->  n  u 

us  «.  "*  uj  uj 
Q  2  h  ws. 
—  r  <  i? 

V)  o  X  »  u 

4  JJ.J  J 


c  o. 
Z  -J 
a  w 
Z  *4 


*7 

—  cc 

UJ  o 


X  <71 

U.  -J 

c-  • 
Z  *g  uj 


a  cl 

-  c  c 

7.  O  O 
"JZ 
a  —  u. 

—  rvi  O 

S  U 

Z  -J  X 
UJ  X  •— 
O  »-  V- 


• 

fst 

»  » 

®  •■ 

WO  — 

-4  X 

-J  UJ 

<0  UJ 

—  0 

10  0 

KJ  »— 

-J  z 

sA 

•  — J 

M  _J 

O  X 

►  M 

z 

a  • 

UJ  • 

z  « 

a.  _ 

-J  OJ 

Cw- 

w  _J 

c  ®  — 

x  <n 

-j  3  «/* 

—  v>  J 

*  -j 

»v*  «J  < 

••  w 

—  > 

pg  ^ 

J 

as 

• 

.— .  — 

3  * 

•-1 

CL  «M 

■A  »» 

3  — 

-J  UJ 

♦ 

w  <4  • 

w  vt 

fj  c  •* 

*SJ  < 

f\J 

j  a 

W-* 

<0 

•VI  < 

•— < 

*  -J 

X 

—  JX 

^  w  <► 

—  w 

U‘  - - - 

♦  -4 

•“*  K. 

U) 

I/'  M  w 

♦  ♦  , 

.  X.  X 

— J 

<  *VJ 

CP  ♦  -N, 

3  • 

UJ 

< 

£3  - 

str  cr  a: 

z 

> 

J«»  * 

ustfxi 

->  X  — 

c 

2 

-0  O  — 

a  tu  x  uj  o' 

u>  </) 

c 

j-g  z  < 

r.  a  uj  cl  c 

^  0  «J 

ft 

CL  »- 

c  0  a  0  z 

Z  < 

> 

•*  Z  UJ 

rcfio 

*  —  > 

Cl 

m 

S  -J  0 

z  c  z  ♦ 

7  _J  -J 

C 

•'-*■««< 

♦  z 

♦ 

Cn*. 

♦  *  < 

O  K*  ISI 

c£ 

»«g 

UJ  ♦  t- 

•— 

• 

t/> 

a  *4  3  ^ 

— *  •>  ^ 

l7.  — ■  *. 

C  x  cnj  r  x 

>  '*  - 

or 

< 

uj  rg  — * 

O  </)  X  _j  .  J 

a;  a.  rc 

UJ 

> 

->  CC  3 

-1  -j 

0  X  z 

7 

3 

x  U>  a. 

J  II  u 

z  0  a 

*-* 

<  -J-l 

e  it 

li  <  «/» 
-•eg  >~  _j 
x  cr  3  uj  < 
^  ^  y  i  > 


CM 
l/'  X 
I  Wf 
1  OC  Z 
‘  J  fl  1 
t 

l  c  * 


<  r  ~  Z  r- 


< 
a 
•  a 

_  ►- 

■  W  w  w  I/O 


r\t  rg  ' 

Z  Z  t 


3 

o 


z 

•  C#  U)  ♦ 
ui  X 

*  •  a  — 

_J  <7i 

j  o  r  o 

«  i,  a  <  r  -  x  i 

LU  J  J  J  J 

V  —  s/> 

X 
U.  or 


<  <C  x  u.  I.  cc 


<M  rsl 

4  ►- 

<N  x  t/i 

►  < 

*  -J  -J 

■J  z  c* 

V  <71  li,  ->  C,  ■ 

^  jC-C, 

•  o  ~ 

>  U-  4-  • 

»2  U*  «  * 

-l  X'  i 

♦  3  •  X  • 

'  _J  >— 

«  o  i/> 

— I  !l  ■—  • 

U.  -J  « 
H  N“ 

>-  —  II 

-J  ’71 

3  <  u.  ki 


o 
o  C 
uj  <r 
w 

^  C 
H 
<7> 

UJ  Q 


u*  <  a: 

►-  y  — 

—  a 


uuu 


o  o  o 


UJ  a 
I-  5 
2  3 
—  O 


z 

< 

D 


l O  w 

3  -4 

5.  ♦  JL 
C  O  U. 

t 1  U.  Z 
^  X  O 
-J  Z  t- 


a 

uj  v— 

7  u 


-m  C*J  —N 

X  II 

Hit-  3  0  0 

»-  7  r  r 

x  u.  «✓>  ,-r  —  — 

X  *-  -»  *-  3  2  3. 

*t  c  ^U.IbU 

;  2  x  i-  a  c*  t 


1*0  II  o 


C  a  ?  O  -  C 
7  h  ’*j  3  a  M 
x  t/i  z  o  a.  uj 

z  ►*  *-  —  —  ~ 


o 

r 


X  O 

o 

o 


II  II  UJ 

3 

~  —  z 


m  c 

*  ii 


>  o  z  » 
z  ~  : 

‘  S  *-  ; 


o 

u.  c 
z  c. 
— .  o 


•  a  < 

<  a. 

uj  a 


r,^g 

2 


u  tu 
u  a 
or  — 

—  c* 
o  z 
Z  x 


C  IL 

<  lii 
UJ  a 


!  Z  —  U»  — 

—  **  3  - 

■: 

<  -  ,-t  >-  i 

t  ’<  j  x 

:  o  c 

:  o  u. 

-*  o 


o 

n 

cr 

O 


</>  *j*  * 

x  ♦  ►-  O  . 

O  Z  c 

-»  a  rr  *-■  * 


O  M  O'  -  *•  I 
*-  **  z  _j  ^  . 


<_>  cj  i_>  u> 


189 


n 


7 

-  / 

(SI  — 


u, 't' 

-J  • 


u  * 

7.  —  — 

0 c 

o  -  — 

—  X  IV 

—  a 

-  o  o 

- I  o 


u,  ^ 

£-S 

V*  *  •/» 
~  ~  < 
-*  X. 


w  «-  * 
V  o  ~ 
c*  —  -* 

cm  O 
*  J2! 

>.  x  — 


i  (C  c 

-  o  —  ^  ^ 

o  a;  —  3  "•  •“* 

J  AJ 

X  ^  vT  rs* 

Ctf  X 

<  unv 
>  O  O 


■  D  *-  *“ 

'  a  z  z 


•  —  a? 

<M  —  VI 
CT  f\J  — •  — • 


.—  X 


*  a 


u,  <2  r, 
a  ^ 


O 

o 


u 

u. 

7 


-  •  3. 

UJ  —  O  • 
O  hOO 
c;  w  «.  o 
OX  C* 


0.0  *  C 

°  °  r  ° 


—  n  —  a 


—  —  c 

*A  —  •  — 

2  — 
O  oc  ^ 

*  —  ►-  CC 


r  fvj 

*  ^ 


o 

2 


cO 


»r  x  —  *  -4  o 

Jl  ^  M  O 

0,0  •  O 

c  „•  ♦  i-  r  >  ^ 

—  —  o  - 

t/»  -4  -*  •  •'Nn 

m  N  •{  N  N  N  C 

* 

HUM'-*** 

^  rj  u.  n  ^  N 

-MOO 


o 

7 


X  * 
•  fv 


w 

X  — 

’-A-  X 

•£■  c* 
—  c. 


—  o 
-  -•  a 

•  —  -V  C  o 

i  J  C  2 

;ir£  o 

,  u  I/1  W  • 
iacf  c  c# 
,  C  <  7  «- 


£  r;rs« 

-  -  / 

C  •  v/'  o  ^ 
^  -J  + 

V  *-J  - 

it;  v'  'f  -n 


>  ~  — 


$ 

c 


c- 


o 


o 

s 


2:  8 


o  o  u  o  o  «-> 


SJ  <_>  o  o  o 


»  i/»  I  Cf  I 


•  O  I 

*  o 


>  N 
I  I  N 
I  o  ® 


•  to  i  >  I  X  <*• 


I  I  m  o 


I  it  c 

—  u  a 

—  a  — • 


i  2  i/' 

j  « 


O  — 
L.  «1 


»  -J  o 

I  *1  * 

I  04  c 

t  X  O 


*  j  i  ►-  i  -  r 


»  o 
«  .* 


r  I  “  l  J  li  J 

f  I  I  C.  7  « 

i  n  i  r  c  L  o1 

(  to  i  C  »  —  O  a 


o  u  o 


•  u; 

•  r. i 

»  r: 

•  c 


o 

< 


c  ~ 
•  o 

o  • 


w.  ~  XV 


c  ~ 

K 
I*  — 

•SI  U. 


tO  id 
^  V' 


—  V  V 

V  —  U  M  7. 

X  t 

«.  -  r  7  3 

,  V  •-  < 

S  L  ”  C  ^  ■ 

k-  -  1/  u  i.  1 


UJ  (J 
3  Uh 

<f  —  o 

C  >-  u;  2  > 
iT  C  -J<  » 
3  >  a'  < 

O  o  <  *  c 

C  O  y  x  £ 

Ul  ^  x  0 

I  —  c  *-  <o 

y-  <i 


m2  O.  7  V*  <* 

O  7  O  ✓>  or 

'  «  uj  " 

<  »-  r  a.  . 

:«iis?T 

/  L-  >  li  U>U 

'J-tf  -  C  3 
_  l 

y~  y.  ✓>  —  *-; 
<<7  7  • 

J  /  ~wC  ^ 
a  c_  i  ^-' 

C  j  >L  *•  S» 

u.  <x  ►*  tJ  • 
r  «t  >“  ►*  a 
•-  u,  x  u.  < 

iuU^Sm- 

X  .o  ^  -  < 

^  o,  C  > 

\f\  Ul  > 

v  *x  *  « 

o  *”  v 
2/  r  >  r 
j  r  w  .  ^ 
UJ  C  K  **  £<• 

7  u  " 

-  u.  %  u  ^ 

i-  >  :  i 

3  ^  «J  ►-  L 

O  4i'  a  uj  ■“ 
a  ^  a  ^  » 

-  *t  -  - 

^  <  *-  y  ' 

«  l-  ai  ~  ‘I  x 

I  t)  i  C  »*  T 

►-  c  •-  o  u  *■' 


OOSJSJSJOOO 


•  tt  M 
O  O  w 
-20 


>  to  —  * 

#  ^  — 
O  2  •  Cl 

a  uiv 
-  7.  C. .  -J 
~  CO 

>  *  a  vo 

I  U)  /  •• 

<.;  c 

”  l/)  v3  Vi 

I  7  J  T 

«f  a 

~  -  >  M 
H  7 

r  >  * 

»  c/>  -  o 

j  a  ^  — 

*1  >*•  2  o. 

uiZO- 


-  to  V 
—  Of  *  ►- 
i'  j.  a 
i  rr  x 

w  -  ^  * 

<7  ^  < 
^*-77 


IL. 


4-i 


192 


Ll 

y 

L 

I 

I. 

L 


L 


ci”*wt.  /nxn/  ipts,  irs.  nt*s<«i.  usme,  abnds,  imic,,  n,  i  *  “um,  ipeta«d,  “apiu 

1°<INT.  NCFN,  NPS,  R,  'JFONS,  AVALS,  rOCPE,  lew,  MFR,  |  K  F  AL*  B  VALS(3,1I 

LAST,  LEf'L,  LF\'i,  KPTS,  I  F  Iti  ,  IRHS,  .SOUS),  I0*i('  (20 , 3 1  I C - 


Inverse  Iteration  Using  a 
Band  Linear  Equation  Solver 


Inverse  Iteration  Using  a 
Symmetric  Indefinite  Linear  Equation 
Solver  Based  on  the  Work  of  J.  R,  Bunch 


Appendix  C;  Sample  Deck  Set-ups  and  Runs. 


The  following  pages  contain  deck  set-ups  and  the  output  of  the  resulting 
runs  for  a  sample  problem.  The  runs  were  made  on  an  IBM  360/91  at  the  Stanford 
Linear  Accelerator  Center.  All  the  cards  with  "//"  or  "/*"  in  columns  1-2  are 
OS/36O  job  control  language  cards,  and  do  not  change  from  problem  to  problem. 

Thus  the  actual  required  input  is  rather  small.  For  information  about  the  in¬ 
put  parameters,  see  the  comments  at  the  beginning  of  each  of  the  program  modules. 
Extensive  use  is  made  of  the  namelist  feature  of  the  IBM  Fortran  language  to 
avoid  the  rigidity  of  formatted  input. 

Object  modules  for  Phase  1,  Phase  2,  and  Phase  3  are  contained  in  the  data 
sets  PUB . JAG . POi ,  PUB.JAG.P02,  and  PUB.JAG.TMP,  respectively. 

The  sample  problem  is  the  following: 

o  +  u  =  k  in  (0,1)  x  (0,1) 

u  =  x2  +  y2  on  x  =  0,1,  0  <  y  <  1, 

y  =  0,1,  0  <  x  <  1. 

2  2 

The  solution  to  this  problem  is  u  =  x  +  y  . 

The  first  run  solves  the  problem  using  piecewise  quadratics  (element  2-6;, 
and  the  second  run  uses  piecewise  cubics  (element  3~*0-  In  both  cases  the 
error  in  the  parameters  is  at  rounding  error  level,  as  is  to  be  expected. 

'The  final  two  pages  of  this  Appendix  contain  a  deck  set-up  for  an  eigen¬ 
value  problem.  The  deck  listed  is  the  one  used  to  produce  the  quintic  entry 
in  Table  5.2.2. 
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Deck  Set-up  for  Sample  Problem. 


// JAGXXTST  JOB  1 JAG$CG?,54/CLASS=E/REG!ON=3O0K 

/  /  STP1  EXEC  LOADGO/PARt  .  G0=  '  S  !  ZE  =  288r>00  1 

//GO. SYSL1N2  DO  DSNAfiE=PUB. JAG. P01, D I SP=OLD/ UNI  1=2314, 

//  VOLUME5SER  =  PUBOQ  1 

//GO. FT01F001  DD  DSNAME=JAGCG. OUT1, UNI T=SYSPA/ D! SP=( NEW, PASS) , 
//  SPACE=(CYL/(1/1)/RLSE) 

//GO . SYS ! N  DD  * 

&PARI.S  NDIVS  =  2/NPS  =  1/MCEN=0/  LAST  =  1,  !  BUG  =  0/  SEND 
&  POINTS  PT(l)  =  (0/0)/PT(2)  =  (l/0)/PT(3)  =  (l/l)/PT(4)s(0/l)  SEND 
&TR  NODES  =  l,2,3,  BND<  1)  =  1,  BND(  2)  =  2  SEND 
STR  NODES5 1, 3, 4,  BND(2)=3/  BND(3)=4,  ENDTR=T  SEND 
/* 

//STP2  EXEC  LOADGO, PARM.GO5 ' S I ZE5288O0O  ' 

/ / GO . SY S L I N 2  DD  DSNAME=PUB . JAG. P02, D 1 SP=OLD, 

//  UN! T=2314, VOLUME=SER=PUB003 

//GO.FTOlFOOl  DD  DSNAM£5J AGCG . OUT1, D I SP= (OLD, PASS) , UN  I T  =  SYSDA 
//GO. FT02F001  DD  DSNAHE=JAGCG .OUT2, D I SP=( NEW, PASS) , UN  1 T=SYSDA/ 
//  SPACE=(CYL,(2/  1) ,RLSE) 

//GO . SYS  1 N  DD  * 

SPAR  MS  I BUG=0, IDEG=2,  NCP=1,  ICP(1)  =  1,  NSP(1)  =  1,  ISP(1,1)  =  1, 
IRHS52,UX2=1/UY251,U2=0/ IEIG=0  SEND 

/* 

//STP3  EXEC  FORTHLG 

/ /  LKED .  JAGP03  DD  DSNAME=PUB.  -JAG. TKP,  Dl  SP=OLP,  UN!  T5  2314, 

//  VOLUME=SER=PUBOO 1 

//LKED . SYS ! N  DD  * 

INCLUDE  JAGP05 
/* 

//GO.  FT01F001  DD  DS  NAM  E=-J  AGCG .  OUT  1,  UNI  T=SYSDA,  D I  SP=  (OLD,  DELETE) 
//GO. FT02F001  DD  DSNAME=JAGCG. OUT2, UN  I T=SYSPA/ D I SP5 (OLD, DELETE) 
//GO. FT03F001  DD  DSNAKE=JAGCG . OUT3, UNI T=SYSDA, D I SP= ( NEW, PASS 7 
//  SPACE5  C CYL/  ( 1, 1)  /  RISE) 

//GO. SYS  I N  DD  * 

SPARMS  N8NDS54, I  PRINT5!, IS0L(1)=10,  SEND 
1  10 
2  10 

3  10 

4  10 

/* 
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hEFINE  JUNE.  TIME  USED:  0.016  SECONDS.  ACCUMULATED  TIME:  0.199  SECONDS 


These  cards  and  input  solve  the  sample  problem 
using  piece-wise  cubics  (element  3-4).  The  output 
from  this  run  appears  on  the  following  pages. 


// JAGXXTST  JOB  ' JAG$CG ! , 54,CLASS  =  E,  REG  I ON=300K 
//STP1  EXEC  LOADGO,  PARM. GG=’ SIZE* 288000' 

//GO. SYS LI N2  DC  DSNAME=PUB.  cAG . P01/ 0  I SP=OLD, UN  I T= 2314, 

//  VOLUME* S E R=  PUBO 01 

//GO. FT01F001  OD  DSNAME* JAGCG .ObTl/  UN  I T=SYSDA,  D I SP=(NEW/  PASS) , 
//  SPACE=(CYL, (1,1),RLSE) 

//GO. SYS  I N  DO  * 

ft PAR MS  NDIVS=3,NPS=0,NCEN=1,LAST=1, IBUG=0,  ftEND 
ft.PO  I  NTS  PT(l)=(0,0),PT(2)=(l,0),PT(3)=(l,l),PT(4)=(0,l)  ftEND 
f.TR  NODES=  1,  2, 3,  BND(1)  =  1,  BND(2)  =  2  SEND 
STR  NODES  =  l,3,4,  BND(2)=3,  BND(3)=4,  ENDTR=T  SEND 
/* 

//STP2  EXEC  LOADGC, PARM. G0= 'SIZE= 288000* 

//GO.SYSLIN2  DO  DSNAME=PUB.  JAG.  P02/DI  SP=OLP/ 

//  UN  I  T=  2314,  VOLUf  :E=SER=P1JB003 

//GO.  FT01F001  DD  DSNAf'E=JAGCG .  OUT1,  D 1  SP=  (OLD,  P  ASS)  ,  UN  1 7=SYSDA 
//GO. FT02F001  DD  DSNAME=JAGCG. OUT2, D I SP=(  NEV  , P  ASS) , UN  I T=SYSDA, 
//  SPACE=(CYL, (2,1),RLSE) 

//GO. SYS  IN  DD  * 

SP ARMS  I  BUG=0,  I DEG=3, NCP=5,  ICP=1,  2,3, 

I RHS*  2, DX  2  =  1, UY  2- 1, U2=0, IEIG=0  SEND 

/* 

// STP3  EXEC  FORTHLG 

//  I.KED.  JAGP03  DD  DSNAME=  PUB .  JAG .  TMP,  D I  SP=OLD,  UN  I  T=  2314, 

//  VOLUhE=SER=PUB001 

// LKED. SYS  I N  DD  * 

INCLUDE  JAGP03 
/* 

//GC. FT01F001  DD  DSNAME=J  AGCG. OUT1, UN  I T=SYSDA, D I SP= (OLD, DELETE ) 
//GO. FT02F001  DD  DSNAME* JAGCG . OUT2, UN  I T=SYSDA, D I SP  =  ( OLD, DELETE ) 
//GO. FT03F001  DD  DSNAME= J  AGCG . OUT3, UN  I T=SYSD A, D I SP= ( NEW,  P ASS) , 
//  SPACE* (CYL, (1, 1),RLSE) 

/ /GO. SYS  I N  DD  * 

SPARMS  NBNDS=4, IPRIMT=1, ISOL(1)=10, ISOL(2)=ll, I SOL( 3 ) = 11, SEND 
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COORDINATES:  0.33333J3  C.  3333333  VALUE:  0.22222222  TYPE:  -l  TRUE  VALUE:  0.22222222  ERROR:  C.00000000 

VALUE:  0.66666667  TYPE:  -2  TRUE  VALUE:  0.66666667  ERROR".  C.  OOOCOOOO 

VALUE:  0.666  6  6  6  6  7  TYPE:  -3  TRUE  VALUE:  0.66666667  ERROP  :  C.  OOOOOOCO 

COORDINATES:  0.3333333  0.  C  VALUE:  0.11111111  TYPE:  1 


These  cards  and  input  produced  the  quintlc  entry  in 
table  5.2.2.  Note  that  the  object  decks  for  the 
inverse  iteration  code  using  Bunch's  symmetric  solver 
are  stored  in  the  data  set  PUB. JAG. I NV. 

//  UAGXXHL5  JOB  '  JAG$CG '  ,  54,  CLASS=E,  REG!  ON=300K 
//STP1  EXEC  LOADGO, PARM.GO® 'SIZE® 288000' 

//G0.SYSUN2  DD  DSNAME*  PUB.  JAG.  P01,  D I  SP-OLD,  UN  I  T=2314, 

//  VOLUME=SER=PUB001 

//GO.FT01F001  DD  DSNAME=JAGCG.0UT1,UNIT=SYSDA, Dl SP=( NEW, PASS), 
//  SPACE® ( CYL, ( 1/ 1)/ RLSE) 

//GO. SYS  I N  DD  * 

& PA RMS  NDt  VS=1,  NPS=4,  NCEN=6,  1451=1,  IBUG=0,  &END 
&POINTS  PT(1)  =  (0,0),  PT( 2)=( 1, 0) ,  PT(3)*<2,0),  PT(4)  =  (2,1), 
PT(5)=(1.3,.7),  PT  (  6 )  =  (  1,  .  7  ) ,  PT( 7)=( . 6, . 6 ) ,  PT(8)=(.9, .9), 
PT(9)=(1.1,.9),  PT(10)=(1. 2, 1),  PT(11)=(1, 1)  SEND 
&TR  NODES=l,2,7,  BND(1)  =  1  &END 
&TR  NODES=7,2,6  &END 
&TR  NODES=6,  2,5,  &END 
&TR  NODES®  2, 3, 5,  BND(1)  =  1  &END 
&TR  N0DES=3, 4, 5,  BND(1)=1  &END 
&TR  NODES=7, 6, 8  &END 
&TR  NODES=6, 9, 8  SEND 
&TR  NODF.S=6,  5,  9  &END 
&TR  NODES=8,9, 11  SEND 
&TR  NODES=9, 10, 11,  BND(2)=1  &END 
&TR  NODES=9, 5, 10  SEND 

&TR  NODES=5, 4,10,  BND(2)=1,  ENDTR=T  SEND 
/* 

//STP2  EXEC  LOADGO, PARM.GO*' SI ZE=288000 ' 

//G0.SYSL1N2  DD  DSNAME=PUB. JAG. P02, Dl SP=OLD, 

//  UNI T=  2314, VOLUME=SER=PUB003 

//GO.FT01F001  DD  DSNAME=JAGCG.0UT1, Dl SP=(OLD, PASS) , UN!T=SYSDA 
//GO.  FT02F001  DD  DSNAME»-J  AGCG.  OUT2,  D I  SP=  ( NEW,  PASS)  ,  UN  I  T=SYSDA, 
//  SPACE=(CYL,  (1,1), RLSE) 

//GO. SYS  IN  DD  *  /n%  , 

&PARMS  I BUG=0,  I DEG=5,  NCP=1,  !CP(1)=1,  NSP(1)=1,  NSP(2)=1, 
nsp(  3)  =  1,  NSP(  4)  =  1,  ISP(1,1)  =  1,  ISP(2,1)  =  1,  ISP(3,1)  =  1, 
i  sp(  4,  1)  =  1,  I  RHS=0,  UX2=  1,  UY2*1,  U2=0,  I  E I  G=  1  SEND 

/* 


/ / STP 3  EXEC  FORTHLG 

// LKED. JAGP03  DO  DSNAME= PUB. JAG. TMP, D I SP=OLD, UN  I T=2314, 

//  VOLUME=SER=PUBOO 1 

/ / LKED. SYS  I N  DD  * 

INCLUDE  JAGP03 
/* 

//GO. FT01F001  DD  DSNAME=JAGCG. OUT1, UN  I T=SYSDA, Dl SP=(OLD/ DELETF) 
//GO.  FT02F001  DD  DSNAME=JAGCG.  OUT2,  UN  I  T=SYSDA,  D I  SP=(OLD, DELETE) 
//GO. FT03F001  DD  DSNAME=J AGCG. OUT3, UN  I T=SYSPA,  D I SP= ( NEW,  PASS) , 
//  SPACE=  ( CYL,  (1,  1),RLSE) 

//GO . SYS  I N  DD  * 

SPARES  NBMDS=1,  I BUG=0  &END 
1  5 

/* 

// STP5  EXEC  FORTHLG 

//LKED. JAGP4  DD  DSNAME=PU8. JAG. I NV, Dl SP=OLD/ UN  1 1=2314, 

//  VOLUME=SER=PUBOO 1 

//LKED. SYS! N  DD  * 

INCLUDE  JAGP4 

/  * 

//GO. FT02F001  DD  DSNAME-JAGCG.OUT3, UN  I T=SYSDA/ D I SP=( OLD, DELETE ) 
//GO. SYS  I N  DD  * 

&PARMS  SHIFT-9.6  &ENO 
& PA RMS  SHIFT»-1  SEND 
/* 
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